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Preface 


In June of 1796 the Literature Gazette, published at that time in Jena, offered 
to its readers the following note (in German): 

New Discoveries. 

Every novice in geometry knows that it is possible to construct geometrically, 
i.e., by ruler and compass, various regular polygons, namely, a triangle, a pentagon, 
a 15-gon and the polygons one can obtain from each of these by consecutive doubling 
the number of its sides. This was known already in the time of Euclid and, it seems, 
that the reigning belief, starting from that time, is that the domain of elementary 
geometry does not surpass these limits: at least I do not know any successful attempt 
to expand it in this direction. Hence, the discovery that, apart from these regular 
polygons, it is possible to geometrically construct a multitude of other polygons, for 
example, a 17-gon, seems to me worth noting. This discovery is essentially a mere 
corollary of a far-reaching theory not completely finished yet. The moment this 
theory is completed it will be offered to the public. 

C. F. Gauss from Braunschweig, 
student of mathematics in Gottingen. 

The theory was completed five years later and published by Gauss in the 7th 
section of the famous Disquisitiones Arithmeticae (Arithmetical Studies ), which ap¬ 
peared in 1801. Gauss proved that if the number n of sides of a regular polygon 
is of the form n — 2 a p\ ■ ■ - pk, where the p t are distinct Fermat primes, i.e., prime 
numbers of the form 2 2 ™ +1, then the polygon can be constructed by ruler and com¬ 
pass. In algebraic language this statement means that for the numbers n indicated 
the equation x n — 1 = 0 is solvable in quadratic radicals. 

The proof of Gauss’ theorem is based on a neat algebraic theory which served 
as the cornerstone for Galois theory created thirty years after Arithmetical Studies 
was published. 

In the 7th section of Arithmetical Studies, apart from the theory of division of 
the circle, i.e., the algebraic theory of circular functions, there is a short remark, also 
by Gauss, to the effect that the method he developed is also applicable to certain 
higher transcendental functions; in particular, to functions related with integrals of 
the form J yfy. 

This remark became a starting point for the studies of Abel, who in 1827 
proved that for the same values of n as mentioned by Gauss, it is possible to divide 
Bernoulli’s lemniscate by ruler and compass into n equal parts. To do that, Abel 
had to considerably improve Gauss’ method and, what is most important, create a 
new mathematical discipline — the theory of elliptic functions. 

The theory of elliptic functions and its geometric twin — the theory of elliptic 
curves — occupies one of the central places in mathematics having unified several 
of its branches. In spite of its senior age, the theory of elliptic functions and elliptic 
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curves remains an alive and rapidly developing domain of mathematics; it is an 
inexhaustible source of techniques, problems, and conjectures for the researchers. 

In the past decade elliptic functions and curves became the subject of close 
attention by experts in such nonclassical fields as algebraic topology and quantum 
field theory; quite recently with the help of the elliptic curve theory Fermat’s Last 
Theorem was finally proved. 

The main topics of this book are the geometry of cubic curves, elliptic functions 
and their properties, elliptic integrals, addition theorems for elliptic functions and 
integrals, arcs of algebraic curves expressible via elliptic integrals, Abel’s theorem 
on lemniscate, arithmetic properties of elliptic curves, Mordell’s theorem, theta 
functions, and solutions of equations of the fifth degree. 

In other words, the book is an introductory course on the theory of elliptic 
functions and elliptic curves and is aimed at those who encounter this topic for the 
first time. However, we hope that the book will be of interest to the experts as 
well. 

The book is based on three lectures written by one of us (Yu. S.) in 1991-1992 
as part of lectures for students organized by the Moscow Mathematical Society. 
The material of the book was collected for the optional course given by the second 
author (V, P.) in 1992-1993 at the Independent University of Moscow. 

In writing this book we used a vast selection of literature, both classical treatises 
and various modern papers. We were greatly influenced by a remarkable paper 
by M. Rosen from American Mathematical Monthly [C14] that contains a modern 
proof of Abel’s theorem. We have also borrowed a lot of useful facts from wonderful 
books of Husemoller [BIO], Koblitz [B12] and Stepanov [B22]. 

The book does not assume from the reader any knowledge beyond the limits 
of beginning courses of mathematics majors in universities and is oriented to the 
widest range of readers: students of mathematics and physics, teachers, and even 
high school students. We hope that, having been acquainted with the subject of this 
book, the reader will be able to feel the charm of the fine art that we experienced 
while deciphering the works of old masters. 

While the book was being written V. Prasolov benefited from a grant from the 
Russian Fund for Basic Research (95-01-00846). 



CHAPTER 1 


Geometry of Cubic Curves 

§1.1. Addition of points on a cubic 

A plane algebraic curve is the set of points (#, y) E K 2 satisfying the equation 
f(x,y ) = 0, where f(x,y) is a nonzero polynomial. 

On certain plane curves, there exist natural laws for addition of points. For 
example, such laws exist on any straight line and on the unit circle # 2 + y' 2 — 1. To 
be able to add points of the line, one should fix a point O on i t an d t hen the sum 
of the points X and Y can be defined as the point Z such that OZ = OX + OY. 

It is natural to define the sum of the points (cos a, sin a) and (cos/?, sin 0) on 
the unit circle to be the point (cos(ai + / 3 ), sin(a + /?)). This law of addition of 
points can be geometrically interpreted as follows. Let E be the point (1,0), A 
and B arbitrary points of the unit circle. Let us draw through E the straight line 
parallel to the straight line AB\ the newly drawn line intersects the circle at the 
point C. Let us define the sum of the points A and B to be C. 

In this form this definition works for any conic (a second order curve). Namely, 
fix a point E on a conic and consider the point at which the straight line drawn 
through E parallel to AB intersects the conic for the second time to be the sum 
of the points A and B. The commutativity of the operation obtained is obvious; 
the point E serves as the zero element. To find the element —A, one should draw 
through A the straight line parallel to the tangent at E. Only the associativity 

(A + B) + C = A+(B + C) : 

is unclear. To prove it, denote the points A+B and B+C by P and Q, respectively. 
The associativity is equivalent to the following statement: If A, B, C, E, P and 
Q are points on the conic such that AB || EP and BC || EQ, then AQ || CP. 

This statement is a particular case of Pascal’s theorem on a hexagon inscribed 
in a conic. 

Examples, a) For the parabola y = x 2 with the fixed point E = (0,0) the 
sum of the points (#1,2/1) and (# 2 , 2 / 2 ) P oint ( x i + #2, 2 /i + 2/2 + 2#i# 2 ). 

b) For the hyperbola x 2 - y 2 = 1 with the fixed point E — (1,0) the sum 
of the points (#1,2/1) and (#2,2/2) is the point (#i # 2 + 2/12/2,2/1^2 + 2/2#i)- Under 
the parameterization # = cosh t and y = sinh t this addition corresponds to the 
addition of the parameter t. 

A cubic is a plane algebraic curve Yli. j a ij x ' l y J — 0) where the greatest value 
of i+j is equal to 3. On any nonsingular cubic, there also exists a quite natural 
law of addition of points. (We will discuss in detail what a nonsingular cubic is in 
§1.3.) The law of addition of distinct points of a cubic can be defined as follows. 

On a cubic, fix an arbitrary point E (it will turn out to be the zero element). 
In order to add the points A and B , draw the straight line AB. It intersects the 

1 ..-. 
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Figure 1 

cubic at the point X. The point of intersection of the line XE with the cubic will 
be considered as the sum of A and B (Figure 1). 

In the definition of the addition we used the following property of a cubic twice: 

If a straight line intersects a cubic at two points, then the line intersects the 
cubic at precisely one more point. 

This property seems to be almost obvious. Indeed, solve the equation of the 
straight line ax + by + c = 0 for x or y and substitute their values into the equation 
of the cubic. We get a third degree equation. By the hypothesis, it has two real 
roots and, therefore, there should exist a third real root. 

In reality everything is not that simple. And the problem is not only that the 
polynomial can have multiple roots. The degree of the polynomial can also turn 
out to be lower than 3. In the latter case the operation of addition is degenerate: 
we cannot add any pair of points and this case is for now of no interest to us. We 
will discuss in §1.2 how it is possible to define addition for all points. 

The commutativity of the obtained operation is obvious. It is also easy to 
verify that E is the zero element. The associativity of the operation is, however, 
not obvious. The equality [A + B) + C — A + (B + C) is equivalent to the fact that 
the intersection points of the straight lines connecting the point A + B with C and 
also B + C with A lie on the cubic (Figure 2). 

Denote the straight lines depicted in Figure 2 as follows: 

Pi—AB, P2~E(B + C), ps — C(A + B), 

Qi = BC, q 2 — E(A + B), q 3 = A(B + C). 

Assume that all the intersection points of the straight lines Pi and cp are pairwise 
distinct. Then the statement to be proved can be formulated as follows. 

1.1.1. Theorem. Let Aij be the intersection point of the straight lines Pi and 
qj, where 1 < i,j <3 and the points Aij are pairwise distinct. Suppose that it is 
known that all the points Aij, except, perhaps, A 33 , lie on a cubic. Then A33 also 
lies on this cubic. 
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PROOF. Let Pi(x, y) — 0 and qj ( x , y) = 0 be the equations of the straight lines 
Pi and qj . Then the third degree equation P1P2P3 — 0 determines the triple of lines 
Pi, P‘2 and p 3 and the equation qiqzqs — 0 determines the triple of lines q\ , q-2 and 
qs. The cubic apip 2 Ps + Pqiq'iQz = 0 passes through all the points Aij. 

It turns out that one can represent in this way the equation of any cubic passing 
through eight of the nine points Ay. Let us prove this. 

Choose the straight lines p\ and q\ as coordinate axes, i.e., assume that pi(x,y) 
= y and qi(x,y) = x. Let the given cubic be determined by the equation P(x, y )-= 
0. The functions P{0,y) and ypzi®, y)Ps(0, y) vanish at the three points .An, 
A 21 and A 31 on the y-axis (Figure 3). Moreover, these functions are polynomi¬ 
als of degree not higher than 3. Therefore, P(0,y) = m/p 2 ( 0 , y)ps(0, y). Similarly, 



Figure 3 
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P(x , 0 ) = flxq 2 (x , 0 ) 93 ( 2 ;, 0 ). Consider the polynomial 

Q{x,y ) = P(x,y) - ayp 2 (x,y)p 3 (x,y) - /3xq 2 (x,y)q s (x,y). 

Clearly, 

Q(0,y) = P(0,y) = m/p 2 (0, y)p 3 (0, y) ~ 0. 

The polynomial ao(y) + a\{y)x + a 2 (y)x 2 H-vanishes identically at x — 0 only if 

ao(y) is identically equal to zero, i.e., if this polynomial is divisible by x. 

Similar arguments show that Q(x,y ) is divisible by y as well, i.e., Q(x,y) = 
xyQx(x,y). The degree of Q(x,y ) does not exceed 3, hence, Q\(x,y) is either a 
linear function or a constant. Now, let us recall that the polynomials P, p 2 p 3 and 
929.3 vanish at the points A 22 , A 23 and A 32 and, therefore, the polynomial Q also 
vanishes at these points. Since at all these points xy / 0, the linear function Q\ 
must vanish at them. The points A 22 , A 23 and A 32 do not lie on one line, and 
for a nonzero linear function / the equation f(x,y ) =0 determines a straight line. 
Hence, Q 1 = 0, i.e., P = apip 2 p 3 + Pqiq 2 q 3 . 

In particular, the point A 33 lies on the curve P(x,y ) = 0. We have also proved 
that any cubic passing through the points Aij is given by the equation 

01 P 1 P 2 P 3 +/?9i9293 = 0. 

In other words, such curves constitute a one-parameter family. The proof of Theo¬ 
rem 1.1.1 is now completed and, together with it, the proof of associativity of the 
addition of points on a cubic. □ 

Prom Theorem 1.1.1, one can get a very simple proof of 

Pascal’s theorem. The intersection points of opposite sides of an inscribed 
hexagon lie on one straight line (Figure 4). 


B 



Figure 4 
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Proof. Let p± = AB , q\ — BC , P 2 = EF, q 2 = DE, p$ = CD, q% = AF. As 
a cubic let us take the curve cut out by the equation Ql — 0, where Q = 0 is the 
equation of a circle and l = 0 is the equation of the straight line UV (here U and V 
are the intersection points of the lines p\ with q 2 and P 2 with q \, respectively). Let 
W be the intersection point of the lines ps and q%. Of the remaining intersection 
points of the lines pi and (p it is known that they lie on the curve Ql — 0. The 
point W also lies on this curve and it must then lie on the straight line l, since it 
is not on the circle. 

Instead of the circle Q = 0 one can take any second degree curve. In particular, 
we may assume that Q — pq, where p and q are linear functions. In this case we 
get 

Pappus’s theorem. Assume the points A, C and E on the straight line p 
as well as the points B, D and F on the straight line q are given. The lines AB 
and DE, BC and EF, AF and CD intersect at the points U, V, W, respectively 
(see Figure 5). Then the points U, V and W lie on one straight line. □ 



Figure 5 * 

Sometimes we have to apply Theorem 1.1.1 when several of the points Ap 
coincide. Hence, we have to understand how we should reformulate the theorem 
so that it remains true in such circumstances. In the course of the proof of the 
theorem we twice made use of the possibility to distinguish the points Ap: 

1) the function xy is nonzero at points A 22 , A 23 and A 32 and, therefore, the 
linear function Q 1 vanishes at them; 

2) these points do not lie on one straight line; hence, Q\ = 0. (Hereafter 
the sign = is sometimes used not as a congruence but to express the notion “is 
identically equal to”.) 

During the proof of Theorem 1.1.1 we only made use of the restriction of the 
polynomial P to the lines pi and qj. Hence, we may expect that instead of requiring 
that the points Ap are distinct, it suffices to assume that 

If two (or three) of the points Ap on the line pi or qj coincide, then the re¬ 
striction of the polynomial P to this line has at the point of coincidence a root of 
multiplicity two (or three). 

This modification also concerns the point A33. 

Let us show that the formulation of Theorem 1.1.1 can indeed be modified in the 
way required. The proof of the fact that the polynomial Q = P — ap\P2P3 — fiqiq2 <Zs 
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is divisible by xy — p\qi works without changes. If A, L j — A ^ = A, then at the point 
A the restriction of P to jp has a root of multiplicity 2, the restriction of P1P2P3 
to pi is identically zero and the restriction of qiq 2 Q 3 has a root of multiplicity 2 
because qj(Aij) — 0 and qk{Aik) = 0. Therefore, the restriction of Q to p % has a 
root of multiplicity 2 at A. 

The arguments are similar for the line qj and also in the case of three coinciding 
points. Hence, it is clear that the linear function Qi still vanishes at points A22, 
A23 and A 32 ■ 

If several of these points coincide, we use the fact that no nonzero linear function 
on the straight line can have a root of multiplicity 2 . 

As for the statement of Theorem 1 . 1 . 1 , it is clear that for the restriction of the 
function ap\p 2 Pz + / 3 (?i<? 2<73 to the line pz the multiplicity of the root at the point 
A 33 is equal to the number of lines q 3 passing through the point A 33 . 

For the line l tangent to the curve F(X) = 0 at a point Xq the restriction of 
F to l is of multiplicity 2 at the point Xq . Indeed, let a point X\ move along this 
curve towards the point Xq. The restriction of the function F to the straight line 
XqXi has roots Xq and X\. In the limit position the straight line XqXi coincides 
with l and the roots Xq and X\ merge into one root of multiplicity 2 (Figure 6 (a)). 
The merging of three roots takes place on the tangent to the inflection point (Figure 
6 (b)). In §1.3 we will discuss in detail points of multiple intersection of a straight 
line with a cubic. 


1 


1 

(a) (b) 

Figure 6 

For curves of degree n > 3 Theorem 1.1.1 can be generalized as follows. 

1.1.2. Theorem. Let Aij be the intersection point of the straight lines Pi and 
qj, where 1 <i,j<n; let points A i3 be pairwise distinct. Suppose it is known that 
all points A tJ , where i + j < n + 3, lie on a curve of degree n. Then the other points 
A^ also lie on this curve. 

PROOF. Let us take the straight lines pi and q\ as coordinate axes. Let the 
curve in the formulation of the theorem be given by the equation P n (x,y ) = 0. 
Then P n (0,y) — api ■ ■ ■ p Tl and P n (x. 0) = fiq\ ■ ■ ■ q n . Consider the polynomial 
Qn — P n — otpi • ■ - pn — /3<?i • • • q n . It suffices to demonstrate that Q n = 0. It is 
easy to verify that Q n is divisible by xy — Piqi, i.e., Q n — piqiQ n -2- It remains to 
prove that the nonzero polynomial Q n - 2 of degree not greater than n — 2 cannot 
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Figure 7 

vanish at the points Aij , where i,j > 2 and i + j < n + 3. (We shall prove this 
statement by induction on n.) 

Suppose that such a nonzero polynomial Q n -2 exists. Its restriction to the 
straight line p 2 vanishes at n —1 points A 22 , A 2 s,... , A 2n - Therefore, the restriction 
of Q n -2 to this line is identically zero, i.e., Q n - 2 = PiQn-z- The polynomial Q n - 3 
vanishes at points forming a similar configuration of lesser size (on Figure 7 these 
points are encircled with a thick curve). 

These arguments illustrate the inductive step. The base of induction (n — 3) 
is considered in the proof of Theorem 1.1.1. □ 


* * * 

A method of proving geometric theorems using the family of curves 
(1.1) P 1 P 2 PZ + Mi©® = 0 

was developed by German mathematician Julius Pliicker (1801-1868). The idea 
to represent a triple of lines as a degenerate cubic turned out to be quite fruitful. 
This representation allowed one to reduce the proof of various complicated geomet¬ 
ric theorems to an ingenious selection of the coefficient /r in (1.1); this j.i started to 
appear often in Pliicker’s papers. 

Such an algebraization of geometry did not appeal to everybody. Jacob 
Steiner (1796-1863) — one of the most prominent geometers of that time — 
even flatly refused to ascribe signs to geometric quantities and preferred to con¬ 
sider instead distinct variants of the points’ positions. In spite of the complicated 
way of treating the subject that Steiner chose, he succeeded several times in getting 
finer and deeper results than Pliicker. Steiner referred unfavorably to new algebraic 
methods in geometry. 


Problems 

1.1.1. The straight lines AB and CD intersect at the point P, the straight lines 
BC and AD intersect at the point Q. A cubic passes through the points A. B, C. 
D , P, Q. Prove that the tangents to the cubic at the points P and Q intersect at 
a point that lies on the cubic. 
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Hint. Apply Theorem 1.1.1 in the case when A 31 = A 32 and A 13 = A 23 . 

1.1.2. A straight line intersects a cubic at the points A , B and C. The tangents 
to the cubic at the points A, B and C intersect the cubic at the points A\, B\ and 
Ci. Prove that the points A 1 , Bi and C x lie on one line. 

Hint. Apply Theorem 1.1.1 in the case when pi = P 2 . 

1.1.3. An octagon with sides li, ... , 1% is inscribed in a conic. Prove that the 
eight intersection points of the lines k and lj, where j - i = 3 (mod 8), lie on one 
conic. 

Hint. Let Pi — hi-i, q% ~ hi, C\ = 0 be the initial conic, and C 2 = 0 be the 
conic passing through 5 of the 8 remaining intersection points of the straight lines 
Pi and q : j. Apply Theorem 1.1.2 to the curve C 1 C 2 — 0. 

1.1.4. Let the intersection points of the straight lines pi = 0, ... , p n = 0 be 
distinct. Prove that the equation of any curve of degree n — 1 passing through all 
intersection points is of the form 

Pi-•••£« f—+ ••■ + —) =0, 

\Pl Pn) 

where Ai, ... , X n are certain constants. 

Hint. Let C — 0 be an equation of such a curve. Consider a straight line l 
not passing through the intersection points of the straight lines pi. It is possible to 
select numbers A i so that at all n intersection points of l with the lines pi we have 

c-Pl---Pn (— + ••• + —) — 0. 

\Pl Pn J 

The same equality holds then at n points of any of the lines p. L . 

§1.2. Lines and curves on the projective plane 

In the preceding section we wrote that the addition of points on a cubic is 
defined, generally, not for all points. Let us illustrate this with an example of the 
curve 

(2.1) y 2 -x(x-l)(x- 2 ) 

plotted in Figure 8. Substituting the equation of the line x = \ into (2.1) we get 
y 2 — The degree of this equation is equal to 2, not 3. Hence, the line x = \ 
intersects the curve (2.1) at two points only, and the intersection points are not 
multiple ones. An attempt to add these points will not be successful. 

If (x, y) is a point on the curve (2.1), then 

lim — = lim —)L= = lim -4= = 0. 

ie —>00 y x-^00 W x 3 x—*oo y/x 

A suspicion arises that both the curve (2.1) and the line x — | pass through an 
infinite point in the direction of the y-axis. The lacking intersection point may turn 
out to be situated on the line at infinity. 

Let us try to augment the collection of the points of the ordinary plane with 
points at infinity, thinking about these points as intersection points of parallel 
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Figure 8 

lines. Another reason for doing this is that otherwise even our formulations and 
proofs of Pappus’s and Pascal’s theorems would be inaccurate. Indeed, we have 
always assumed so far that the lines under consideration do indeed intersect. But 
they might be parallel as well. We can certainly consider, separately, the cases 
when certain lines intersect and certain lines are parallel, but this is quite tiresome 
because every case requires not only a separate formulation but a separate proof. 

Put the plane 7 r in a three-dimensional space and consider a point O outside the 
plane tt. With every point A G tt associate the straight line OA. To the line l E tt 
this correspondence assigns not the whole plane 01 , i.e., the plane that contains the 
point O and the line l, but the part of 01 without the line l' that passes through O 
parallel to l. If the line li € 7 r is parallel to the line l, then the planes 01 and Ol\ 
intersect along a straight line, l'. It is also clear that if the point A runs along the 
line l to infinity, then the limit position of the line OA is the straight line V. 

Define the real projective plane MP 2 as follows. The points of MP 2 are the lines 
passing through O. Let the lines in MP 2 be the planes passing through O. In this 
picture the lines parallel to a plane tt correspond to the infinite points of tt and the 
plane parallel to tt corresponds to the line at infinity in tt. On the projective plane, 
any two straight lines intersect at one point. The projective lines corresponding to 
parallel lines in tt intersect at a point of the line at infinity. When we forget about 
tt, the infinite points do not differ from the other points. 

To deal with algebraic curves we have to introduce coordinates on the projective 
plane. Assume that the point O is the origin of the coordinate system in a three- 
dimensional space and the plane tt is given by the equation z — 1. A line passing 
through O consists of the points of the form (Xx,Xy,Xz), where x, y, z are fixed 
and A runs over M. Therefore, we may consider the nonzero triples of real numbers 
(, x,y,z ) as points of MP 2 ; here the triples ( x,y,z ) and (Xx,Xy,Xz), X 7 ^ 0, are 
considered to be equivalent, and MP 2 is the quotient of the set of triples modulo 
this equivalence relation. The line at infinity is given by the equation z — 0. 

In the definition of the projective plane x,y, z and A can be taken to be complex 
numbers. In this way we get a definition of the complex projective plane, CP 2 . The 
geometry of algebraic curves in CP 2 is considerably simpler than that in MP 2 . 
This phenomenon is related to the fact that over C every nth degree polynomial 
has precisely n roots (multiplicities counted). 
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To the curve 

(2.2) y 2 = x(x — l)(x — 2) 
we can assign the curve 

(2.3) y 2 z = x(x - z)(x — 2 z) 

on the projective plane. Indeed, equation (2.3) actually defines a curve on the 
projective plane because the points ( x,y,z ) and (Xx,Xy, Xz) either simultaneously 
satisfy (2.3) or not. Moreover, on the plane tv, given by the equation z — 1, both 
equations, (2.2) and (2.3), coincide. 

Similarly, to any algebraic curve a lJ x' l 'y J = 0 we can assign the curve 
Q>ijX l y :i z n ~ l ~i — 0, where n — max(i + j), 

on the projective plane. Now we can verify our hypothesis that the line x — \z 
and the curve y 2 z = x(x ~ z)(x — 2z) meet at the infinite point in the direction 
of the y- axis. Substituting the expression x — § into the equation of the curve we 

get y 2 z ~ -§-. This equation has three types of solutions, namely, (1) z — 0, y is 
arbitrary; (2) y = kz and (3) y — —kz, where k = ^3/8 and a: is arbitrary. In 
other words, each time we get a family of solutions, each family corresponding to 
one point of CP 2 . 

Therefore, the line x = ~z on the projective plane does, indeed, intersect the 

curve considered at the three points: (|, y'f , 1)» (|>"\/i> 1 )) (0,1,0). The third 
point is the infinite one in the direction of the i/-axis. 

It is even possible to draw a sketch of what the line x = ~z and the curve 
considered look like in a neighborhood of the infinite point (0,1,0). To this end, 
instead of the plane z — 1 one should take a plane passing through the point (0,1,0) 
and not passing through the origin. Take, for example, the plane y — 1. On it, we 
get the curve z — x(x — z)(x — 2 z). For small x and 2 our curve looks almost like 
the curve z = x 3 (Figure 9). 



Figure 9 

The passage to the projective plane is helpful not only in the case considered 
above. Let us show, for example, that any line on the projective plane either 
entirely belongs to the cubic or intersects it (multiplicities counted) at precisely 
three complex points; if we consider real points only, then it intersects the cubic at 
either one or three points. 
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We can find the intersection points of the straight line ax + by + cz = 0 and 
the cubic J2i+j+k =3 — 0 on the projective plane as follows. One of the 

numbers a, b, c is nonzero. Let, for instance, c ^ 0. Then z = ax + fiy, where 
a — —a/c and /? = — b/c (the case a — /3 = 0 is not excluded). Inserting this 
expression into the equation of the cubic we get an equation of the form Q — 0, 
where Q(x,y ) = b v x p y 3 ~ p . The following two cases are possible: 

1) All the coefficients b p are zero. Then the line ax + by + cz — 0 is entirely- 
contained in the curve, i.e., Q is divisible by ax + by + cz. 

2) Not all the coefficients b p are zero. Then 

Q{x, y) = bx r y s (x - t x y) • • • {x - t m y), 

where r + s + m = 3. To the factor x r there corresponds the intersection point 
(0,1,/?) of multiplicity r; to the factor y s there corresponds the point (1,0, a) of 
multiplicity s; and to the factor x — Uy there corresponds the point (i;, 1, at. t + (3). 

A cubic polynomial Q with real coefficients can have either three real roots or 
one. Therefore, any line on the projective plane intersects a cubic either at three 
real points or at one point (multiplicities counted). Therefore, we have almost 
managed to clarify how to add noncoinciding points on a cubic. 

We only have difficulties with points of self-intersection or with cusp points. 
The problem is that any line passing through such a point has a multiple intersection 
with the curve (cf. Problems 1.2.2 and 1.2.3). Therefore, taking the sum of such a 
point with any other point we never get any new points. We will discuss singular 
cubics in more detail in §1.5. 

Now, it only remains to investigate the addition operation for coinciding points 
and figure out what the geometric meaning of the multiplicity of the intersection 
point is. We will do this in the following section. 


Problems 

1.2.1. Prove that the curve y 2 — x 3 +px + q intersects the infinite line z = 0 
at one point and the multiplicity of the intersection is equal to 3. 

1.2.2. Prove that at the point (0,0) any line intersects the curve y 2 = x 2 {x + 1) 
with multiplicity not less than 2 and for the lines y — ±x the multiplicity is equal 
to 3. 

1.2.3. Prove that at the point (0,0) any line intersects the curve y 2 — x 3 with 
multiplicity not less than 2 and for the line y — 0 the multiplicity is equal to 3. 

1.2.4. Prove that on the complex projective plane any circle (x—a) 2 + (y—b) 2 — 
R 2 passes through the infinite points (1, z, 0) and (1, —z, 0). 

§1.3. The tangents and inflection points 

To add points A and B on a cubic we have to draw the line AB. How should 
one act if the points A and B coincide? Let us assume that the point A is fixed and 
the point B moves towards A along the given curve. Then, under certain conditions 
(point A should be nonsingular), the line AB tends to a fixed line, the tangent at 
A. Therefore, to find the sum A + A, instead of the line AB, we should draw the 
tangent at A (provided the tangent is uniquely defined at this point). 

If the curve passing through points A and B is given by an equation F — 0, then 
the restriction of F to AB has roots at points A and B. In the limit position, when 
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points A and B coincide, the restriction of F to A has a multiple root. Therefore, 
the restriction of F to the tangent has a multiple root at the tangent point. This 
property can be used in order to get the equation of the tangent. 

Let the point P — (pi,p 2 ,P 3 ) belong to the curve F = 0, i.e., F(P) = 0, and 
let X = {xi,X 2 ,Xz) be an arbitrary point. The points of the projective line PX 
are of the form A P + pX. The points of this line distinct from X are of the form 
P + tX. Let us consider the restriction of F to the line PX as a function of t. In 
the case of interest to us F is a polynomial of degree 3, hence, 

F(P + tX) = F(P) + at + bt 2 + ct 3 = Q(t), 

where F(P) — 0, a = '%2F'i{P) x i, and b = | YjPij{P) x i x j (here Fi is the partial 
derivative of F with respect to the ith variable). The point P corresponds to 
the value t — 0. The polynomial Q(t) has a multiple root at zero if a = 0, i.e., 
EFi(P)xi = 0. 

A point P for which at least one of the numbers Fi(P ) is nonzero is called a 
nonsingular point of the curve. For a nonsingular point P the equation F) (P)xi = 
0 uniquely determines the line l tangent to the curve at P. 

The tangent to the curve is geometrically uniquely defined regardless of the 
coordinate system. In our approach it is not yet clear that the definition of the tan¬ 
gent and the singularity of a point does not depend on the choice of the coordinate 
system. Let us prove the invariance of these definitions. 

Now we show what happens under the change of coordinates (aq, £ 2 , 2 : 3 ) *-*■ 
(tti, U 2 ,u 3 ), where aq = Yjj a ij u j- Let G{u\, tt 2 , U3) — P(xi(u), x 2 (u),x 3 (u)). Then 

n __ dG _ dF dxi _ ^—\ ^ 

j ~ duj ~^dx i frT j ~ ^ iaij ' 

Since the matrix J = (a. LJ ) is nonsingular, the triple (Gh,Gb, G 3 ) is nonzero if 
and only if the triple (Fi,F 2 ,F 3 ) is nonzero. If / and g are rows (Fi,F 2 ,F 3 ) and 
{Gi,G 2 ,G 3 ), and x and u are columns {xi,x 2 ,x 3 ) t and {ui,U 2 ,u 3 ) t , respectively, 
then x ~ Ju and g — fj. Hence gu = ( fJ)(J~ 1 x ) = fx and the equations fx = 0 
and gu — 0 determine the same line. 

To pass from the projective coordinates ( 3 : 1 , 312 , 3 : 3 ) to the Cartesian ones 
i x h x i) we have to set x 3 — 1. Assume pi = 1. To satisfy the condition x 3 = 1 
for a point of the line P, we have to express the points of the line PX in the form 
P + t(X — P). The expansion 

F(P + t(X - P)) = ^Fi(P)(xi - pi)t H- 

allows us to express the equation of the tangent in the form 
Fi{P)xi + F 2 (P)x 2 = F l {P) Pl + F 2 {P)p 2 . 

In the projective coordinates, i.e., for a homogeneous function F, the expression 
Y^Pi{P)Pi is equal to zero. The point is that for any homogeneous polynomial F 
of degree n the Euler formula 

Y J Fi{X)x i =nF(X) 

holds. Consider a monomial M = x^x^x^ 3 , where P{ = n. Clearly, for the 
nonnegative pi we have Xidi(M) — PiM. It remains to recall that p + q + r — n. 
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Let P be a nonsingular point on the curve F = 0. Then the tangent l at the 
point P is defined. The restriction of F to l has a multiple root at P. If the multi¬ 
plicity of this root is not less than 3, then P is called an inflection point. In other 
words, the condition a = X) Fi(P)xi — 0 must imply that b — \ J2 F tj (P)x t Xj = 0, 
i.e., the quadric X F tJ (P)x t Xj = 0 should contain the line )T F l (P)x i = 0. 

Recall that the second degree polynomial x T Ax (expressed here in the matrix 
form) is divisible by the linear function x T l only if x T Ax — x T lm T x for some to. 
This means that the matrix A = lm T is the product of a column by a row, i.e., 
is of rank 1. In particular, det .A = 0. Thus, if P is an inflection point, then 
det (Fij(P)) = 0. 

Let us show that for a nonsingular point on the curve the converse is also true, 
i.e., if P is a nonsingular point and det (Fij(P)) = 0, then P is an inflection point. 
Let us consider the quadric J^F i j(P)x i x j = 0. The point P belongs to it since by 
the Euler formula 

Fij (P)ViPi = 2 ]TFj ( p )Vi = 6 A A = 0. 

Moreover, the line J2i Fi(P)xi = 0 is the tangent to this quadric at P. Indeed, the 
equation of the tangent to the quadric Fij(P)xiXj = 0 at P is of the form 


yi Fij ( P)%iPj — 0 


and by the Euler formula YLi,j Fij(P) x iPj = 2 Xa Fi{P) x i■ We did not yet use the 
degeneracy of the quadric; in any case the tangent to the curve at P is at the same 
time the tangent to the quadric • Fij(P)xiXj = 0. But in the case when this 
quadric consists of a pair of lines it entirely contains the tangent. 

Let us summarize. The set of intersection points of the curves F = 0 and 
H = 0, where H(X) — det (Fij(X)), contains all the inflection points of the curve 
F = 0 (among these intersection points only singular points of this curve will not 
be inflection points). The curve H = 0 is called the Hesse curve or the Hessian 
of the curve F — 0. If F is a homogeneous polynomial of degree n, then 
is a homogeneous polynomial of degree n — 2. Therefore, H is a homogeneous 
polynomial of degree 3(n — 2). For a cubic polynomial F the polynomial H is also 
a cubic one. 

The invariance of the notion of the inflection point and of the Hesse curve can 
be proved almost in the same way as we proved the invariance of the tangent. 

Let G{ui,U2,us) = F(xi(u),X2{u),x 3 (u)), where x t = J2 a ij u j- Then 



d 2 G 

dUpdu q 


d 2 F dxi dxj _ y "' jp 

-L " F aip ' ijaiq 

lij ' l >3 


i.e., {G pq ) = J T {Fij)J, where J = (a %3 ). Therefore, det{G pq ) = (det J) 2 det (Fij). 
Hence, the conditions det(F^) — 0 and det(G pg ) = 0 are equivalent. 

The search for the inflection points of the curve reduces to the search for the 
intersection points of the curve with the Hessian. So we have to find the intersection 
points of the two curves. We have already done this in the case when one of the 
curves is a straight line. The equation of the straight line enables us to express one 
variable in terms of the other one. Substituting this expression into the equation 
of the curve we can exclude one of the variables. For the curves of arbitrary degree 
we can also exclude one variable, but it is more difficult to do. To make the 
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representation more visual, we will first consider curves in the Cartesian coordinates 
(x,y) and only afterwards pass to the projective coordinates ( x,y,z ). 

For simplicity, let us confine ourselves to the third degree curves. It is possible 
to express the third degree polynomials F(x,y ) and H(x,y ) in the form 

F(x,y) - a 0 y 3 + at(x)y 2 + a 2 (x)y + a 3 (x), 

H{x,y) = b 0 y 3 + h(x)y 2 + b 2 (x)y + b 3 {x), 

where ak(x) and bk{x) are polynomials of degree not greater than k, 0 < k < 3. 
If (xo,yo) is a common point of the curves F(x,y) — 0 and H(x,y) = 0, then 
polynomials f(y) — aoy 3 + a\y 2 + a 2 y + a 3 and h(y) — boy 3 + b\y 2 + b 2 y + b 3 , where 
dk = a,k(x) and bk — bk{x), have a common root yo) the converse is also true: if the 
polynomials have a common root yo, then the curves have a common point (xo,yo). 

Over C, two polynomials have a common root if and only if they have a common 
nonconstant divisor (over M the common divisor may have no roots). If aobo ^ 0, 
then the polynomials f(y) and h(y) have a common divisor if and only if there exist 
polynomials hi and fi such that fh± — hfi , where the degrees of hi and fi are 
lower than the degrees of H(x,y ) and F(x,y), respectively. 

Indeed, if / and h have a common divisor d, then we may set fi = fd~ x and 
hi — hdr 1 . If fhi — hfi and deg/i < deg/, then all prime divisors of / should 
occur in the prime factorization of hfi ; moreover, they enter with the same degrees. 
On the other hand, not all of them occur in the factorization of fi. 

The restriction aobo /- 0 is, no doubt, bothersome but in the projective case it 
is easy to satisfy. 

Let hi(y) — uoy 2 +uiy+u 2 and fi(y) — v 0 y 2 +viy+v 2 . The equality //ii = hfi 
can be expressed in the form 


aoUo 



-b 0 v o 


= 0, 

aiu 0 



-Mo 

-Mi 

= 0 , 

a 2 uo 

+aiui 

+a 0 u 2 

-Mo 

-Mi 

-M 2 = o, 

a 3 uo 

+a 2 ui 

+aiu 2 

-Mo 

-Mi 

-M 2 = 0, 


a 3 ui 

+a 2 u 2 


-Mi 

-M 2 = 0, 



a 3 u 2 



-M2 = 0. 


This system of linear homogeneous equations with respect to u and v has a nonzero 
solution if and only if its determinant vanishes, i.e., 


a 0 

a t 

a 2 

«3 




ao 

at 


a 3 




ao 

ai 

a 2 

a 3 

bo 

bi 

b 2 

b 3 




bo 

bi 

b 2 

bs 




bo 

bi 

b 2 

b 3 


This determinant is called the resultant of the polynomials / and h. The coefficients 
Ofc and bk depend on x and, therefore, the determinant (3.1) is a polynomial R of 
x , perhaps the zero one. For every root Xq of the polynomial R(x) the curves 
F(x,y) — 0 and H(x,y ) = 0 have a common point {xo,yo). (Observe that in the 
real case the fact that xo £ R does not necessarily imply that yo € E.) If the 
polynomial R(x) is the zero one, then the curves have a common component. 
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Now, let us repeat the above arguments for the projective coordinates. Ex¬ 
pressing the polynomials F and H in the form 

F(x, y, z) — a 0 z 3 + a x (x, y)z 2 + a 2 (x, y) + a 3 (x, y), 

H(x, y, z) = b 0 z 3 + b!(x,y)z 2 + b 2 (x,y) + b 3 (x,y), 


where a*, and bk are homogeneous polynomials of degree k for 0 < k < 3, it is 
possible to choose coordinates so that the curves F = 0 and H = 0 do not pass 
through the point (0,0,1). Then the condition we need, a 0 b 0 0, will be satisfied. 

In the projective case the determinant (3.1) is a polynomial in two variables, 
R{x,y). Let us prove that R is either the zero polynomial or a homogeneous 
polynomial of degree 9 (for curves of degrees m and n the degree of R(x, y) is equal 
to mn). Indeed, 


R(Xx, Xy) 


ao Xai 

X 2 a 2 

X 3 a 3 



a 0 

Xai 

X 2 a 2 

A 3 a 3 



ao 

Xai 

A 2 a 2 

X 3 a ; 

bo Xbi 

X 2 b 2 

A 3 63 



bo 

A&i 

X 2 b 2 

A 3 63 



bo 

A 61 

A 2 b 2 

A 3 fo; 


Let us multiply the second and fifth rows by A and the third and sixth by A 2 . As 
a result, we get a matrix for R(x,y) in which the fcth column is multiplied by A fc . 
Therefore, X 6 R{Xx, Xy) = A 15 R(x, y), i.e., R(Xx, Xy) — X 9 R(x, y). 

In the general case 


X p+q R{Xx , Ay) = X r R{x,y), 

where p~ l + 2 + -- - + (n — 1) — r = 1H-h (m + n— 1) = ( m+n )(g t + n ~ 1) 

j ra(m—1) 

and q = — ^— 1 . 

It is easy to verify that r —p — q — mn. 

9 ; 

The nonzero polynomial R(x , y) can be represented in the form ]Q (y-ix - Xiy ), 

i=l 

where aq and yi do not vanish simultaneously. For every one of the nine pairs 
(; Xi,yi ) there exists z { such that (, x it y it Zi ) is the intersection point of the curves 
/ = 0 and h = 0. The polynomial R(x, y) can have multiple roots, i.e., certain pairs 
( ; Xi,yi ) can be proportional. Therefore, not every pair of cubics has nine distinct 
common points. But in the complex projective plane every two cubics have at least 
one common point. Hence, 

any nonsingular cubic has an inflection point 
{nine inflection points, multiplicities counted). 

This is precisely the property we will need in the next section. 


Problems 

1.3.1. Prove that a point (aro,yo) on the curve y - f{x) is an inflection point if 
and only if f"{x o) = 0 . 

1.3.2. Prove that all points on the curve y 2 = (x - xi)(x —x 2 )(x - x 3 ) are 
nonsingular if and only if the numbers aq are distinct. 
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1.3.3. Prove that on the curve y = (x — xi)(x — X 2 )(x — x 3 ) all points except 
( 0 , 1 , 0 ) are nonsingular. 

1.3.4. Let A and B be inflection points on a cubic, and C the third intersection 
point of the straight line AB with the cubic. Prove that C is an inflection point. 

Hint. Apply Theorem 1.1.1 to the case when p\ = P 2 = P 3 — AB and the 
lines qi , q 2 and q 3 are the tangents at the points A, B and C, respectively. 

1.3.5. The tangents to a cubic at the points A and B intersect at an inflection 
point P and the line AB intersects the curve at C. Prove that PC is a tangent to 
the curve. 

Hint. Apply Theorem 1.1.1 to the case when pi is the tangent at P, p 2 — 
Ps — AB, qi and q 2 are tangents at points A and B, and q 3 = PC. 

§1.4. Normal forms of the nonsingular cubic 

A cubic curve is called nonsingular if all its points are nonsingular. In this 
section we prove that over C the equation of a nonsingular cubic can be reduced 
by linear changes of homogeneous coordinates to any of the following forms: 

1 ) y 2 z = x 3 + pxz 2 + qz 3 (Weierstrass’ form); 

2) x 3 + y 3 + z 3 = 3A xyz. 

In the first case the polynomial x 3 + px + q has no multiple roots (otherwise the 
curve is singular) and in the second case A 3 ^ 1 (otherwise the curve consists of 
three lines). 

Consider a nonsingular cubic z 3 ^' 1 ^^ — 0 over C. In the preceding 

section we have shown that it has an inflection point. We may assume that the 
coordinates of the inflection point are equal to ( 0 , 1 , 0 ) and the tangent to this 
point is given by the equation z = 0. In other words, the restriction of the function 
F(x, Vi z ) — a ij xi y j z 3 ~ l ~i to the line z — 0 (i.e., the polynomial a 30 a : 3 +a 2 ix 2 y+ 
a\ 2 xy 2 + ao 3 ?/ 3 ) has a root x — 0 of multiplicity 3. It follows that <221 = ai 2 = 
003 = 0 but a 30 / 0 , since otherwise the curve considered would have contained 
the whole line z — 0. The tangent at (0,1,0) is given by the equation 

F x { 0,1, 0)x + F y { 0 , 1 ,0 )y + F z ( 0, 1,0 )z = 0. 

Hence, ^(0,1,0) - F y (0,1,0) = 0 but F*(0,1,0) ^ 0, since otherwise the point 
(0,1,0) would have been singular. The value of the homogeneous polynomial 
F z (x , y, z) of degree 2 at (0, 1 , 0 ) is equal to <202 and we may assume that <202 = 1- 
In Cartesian (not projective) coordinates the equation of the curve then takes the 
form 

y 2 ~ 2 {ax + b)y + P 3 (x) = 0, 

where P 3 is a third degree polynomial. Making the change of variables y\ = y—ax—b 
we get 

y\ - (ax + b) 2 + P 3 (x) = 0, 

i.e., y\ = Q 3 (x), where Q 3 (x) = (ax + b) 2 — P 3 (x) is a third degree polynomial. By 
a change of the form x = Aaq + p the polynomial Q 3 can be reduced to the form 
xf + px 1 + q. 

The polynomial Q 3 has no multiple roots, since otherwise the equation of the 
curve could have been reduced to the form y 2 — x 2 (ax + j3) and for such a curve 
the origin is a singular point. 
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In the preceding section we proved that any cubic has 9 inflection points, multi¬ 
plicities counted, but we could not determine whether or not all of them are distinct. 
If the equation of the nonsingular cubic is expressed in the form y 2 = Qs(x), then 
it is easy to find the intersection points of the cubic with the Hessian and show that 
all of them are distinct. 


1.4.1. Theorem. The nonsingular cubic y 2 = Qz(x) in CP 2 has precisely 9 
distinct inflection points. 


Proof. We may assume that the polynomial Q 3 has a root x — 0, i.e., the 
curve considered is given by the equation f(x,y) — 0, where f(x,y) — y 2 - x 3 - 
ax 2 — bx. Since the polynomial Q 3 has no multiple roots, it follows that 6^0 and 
a 2 - 4b ^ 0. To get an equation of the Hessian, pass to homogeneous coordinates: 
F(x,y, z) — y 2 z — x 3 — ax 2 z — bxz 2 . Then 


H(x,y,z) - 


— 6 x — 2 az 
0 

—2 ax — 2 bz 


0 — 2 ax — 2 bz 

2 z 2 y 
2 y —2 bx 


= 8[(y 2 + bxz)(3x + az) - (ax + bz) 2 z], 


i.e., (dividing by 8) we have 

h(x,y ) = y 2 (3x + a) + bx(3x + a) — (ax + b) 2 . 

It is easy to find the intersection points of the curves / = 0 and h = 0. Let us 
express the equation / = 0 in the form y 2 — x 3 + ax 2 + bx and substitute this 
expression into the equation h = 0. As a result we get 

(. x 8 + ax 2 + bx)(3x + a) + bx(3x + a) — (ax + b) 2 = 0, 


i.e., 


q(x ) = 3a? 4 + 4 ax 3 + 6 bx 2 — b 2 = 0. 

Let us prove that the polynomial q(x) has no multiple roots. Its derivative is equal 
to 12(a; 3 + ax 2 + bx). Therefore, 


q{x) - -^ 7 ™ ^3a: + a - — (4b - a 2 )x 2 . 

Suppose that < 7 ( 2 : 0 ) = q'(xo) — 0. Then xq 7 ^ 0, since g(0) = — b 2 ^ 0. On the other 
hand, (4b - a 2 )xQ — 0, where 4b - a 2 ^ 0. Hence, £0 — 0- Contradiction. 

We have proved that the polynomial q(x) has four distinct roots Xi- To every 
root Xi there correspond two distinct values of y because y 2 = xj + ax 2 + bxi = 
2-^ 7 ^ 0. Thus, the curves F = 0 and H = 0 have 8 intersection points in the 
finite domain z 7 ^ 0. Since F(x,y, 0) = —x 3 and H(x,y,0) — 24 xy 2 , it follows that 
on the infinite line z = 0 the curves F = 0 and H — 0 have precisely one common 
point, (0,1,0). The proof of Theorem 1.4.1 is completed. □ 


Any straight line passing through two inflection points contains one more in¬ 
flection point. Indeed, we may assume that the coordinates of one of the inflection 
points are (0,1,0). If (x 0 ,y 0 ) is a common point of the curve y 2 = x 3 + ax 2 + bx 
and its Hessian y 2 (3x + a)+bx(3x + a) = (ax + b) 2 , then (a?o, ~Vo) is also a common 
point. The points (0,1,0) and (xq, Fyo, 1) lie on the line x = xqZ. 
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Figure 10 


Schematically the configuration of the nine inflection points and the twelve 
straight lines that host them is depicted in Figure 10(a). A more symmetric scheme 
of this configuration is shown in Figure 10 (b). 

Recall one of the important notions of the projective geometry. Four points 
in CP 2 are called generic points if no three of them lie on one straight line; four 
straight lines in CP 2 are called generic lines if no three of them meet at one point. 

Points with homogeneous coordinates ( Xi,yi,Zi ) = e*, where i = 1 , 2 , 3 , 
4, are generic if and only if the vectors ei, e 2 , e 3 are linearly independent and 
e 4 = Aiei + A 2 e 2 + Ase 3 , where A 1 A 2 A 3 ^ 0. It is not difficult to show that 
for any 4-tuple of generic points in CP 2 there exists a projective transformation 
(i.e., a linear transformation of homogeneous coordinates ) that maps this 4-tuple 
into any given 4-tuple of generic points. 

Indeed, let {e ^}| =1 and be two 4-tuples of generic points. Then e 4 = 

Aiei + A 2 e 2 + A 3 e 3 and £4 — nie\ +jU 2 £2 + ^3£3, where A 1 A 2 A 3 7^ 0 and ¥" 0- 

The projective transformation required is as follows: 


ei OiiSi, 


where a, = 


— and i = 1 , 2 ,3. 


There is a one-to-one correspondence between the sets of points and the set of 
straight lines in CP 2 : to the point (a, b, c ) there corresponds the line ax+by+cz — 0 
(.projective duality). If the points A and B lie on the line l. then the lines a and b 
dual to A and B meet at the point L dual to the line l. 

To the line ax+by+cz — 0 there corresponds the point (a, b, c ) = a. Therefore, 
if CP 2 is subjected to the projective transformation (x, y, z) i-» (x, y, z)A, where A 
is a nonsingular matrix, then the straight line (x, y, z)a T — 0 turns into the straight 
line (x,y, z)Aa T = 0, i.e., (x,y, z)(aA T ) T = 0. Therefore, the transformation law 
of the coordinates on the line is a 1 —> aA 1 . This transformation is also a projective 
one. Hence, the projective duality makes it possible to prove that any 4-tuple of 
generic lines can be turned into any other 4-tuple of generic lines by a projective 
transformation. 




§1.4. NORMAL FORMS OF THE NONSINGULAR CUBIC 
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1.4.2. Theorem. By a change of coordinates the 9 inflection points of a cubic 
curve can be transformed into the following set of 9 points : 

(0,1, —1) (0,e 2 , —e) (0,£,-£ 2 ) 

(- 1 , 0 , 1 ) ( _ £ 2 , 0 , 1 ) (-£, 0 , 1 ) 

(1, —1,0) (—£,1,0) (-£ 2 ,1,0), 

where e 3 — 1 and e ^ 1, i.e., £ 2 + £ + 1 = 0. 

Proof. The straight lines 189, 463, 527 (Figure 10(b)) cannot intersect at one 
point. Indeed, let R be a common point of these lines. Any 4-tuple of generic 
points in CP 2 can be transformed by a projective transformation into any other 
four generic points. Therefore, we may assume that the points R, 1,5 and 6 are 
real ones. Then all the other points of the configuration are also real. It is easy to 
see that this is impossible. 

Adding the line 145 to the indicated triple of lines we get a 4-tuple of generic 
lines. Therefore, we may assume that the lines 145, 189, 463 and 527 are given 
by the equations x + y + z — 0, x — 0, y — 0 and z = 0, respectively. Then the 
coordinates of the inflection points are of the following form: 

(0,1, —1) (0, a, —b) (0, c, —d) 

(4.1) (-1,0.1) (-a',0,1) (-c',0,1) 

(1,-1,0) (-(>',1,0) (-d',1,0). 


where all the numbers a, b, ... , d' are nonzero. 

The points (0, a, —b ), (—a 7 ,0,1), and (—5', 1,0) lie on one straight line, namely, 
on line 862. Therefore, ab' — ba’ and we may assume that a — a! and b — b'. 
Similarly, c — c' and d — d'. 

Considering the lines 167 and 123 we get a ~ d and b = c, respectively. Con¬ 
sidering the lines 538 and 596 we get a — be and c — ad, respectively. It follows 
that b 3 — 1 and a = b 2 . It is also clear that b f 1. Substituting a = e 2 , b — e, c = e 
and d — £ 2 into (4.1) we get the points required. 

It is easy to verify that the remaining eight lines containing triples of given 
points are given by equations of the form x + ay + /3z = 0, where a and /3 take 
values 1, £ and £ 2 . □ 

With the help of Theorem 1.4.2 it is easy to prove that any nonsingular cubic 
can be reduced to the form x 3 + y 3 + z 3 + 3A xyz + 0. Indeed, let the inflection point 
of the given curve have the coordinates indicated in the formulation of Theorem 
1.4.2. These points belong to both the triple of lines xyz — 0 and the triple of lines 

(x + y + z)(x + ey + s 2 z)(x + e 2 y + ez) — 0. 

Therefore, any cubic passing through these nine points is given by the equation 
yxyz + v(x + y + z)(x + ey + e 2 z)(x + e 2 y + ez) — 0. 

It remains to note that 

(x + y + z)(x + ey + £ 2 ^)(x -f e 2 y + ez) = x 3 + y 3 + z 3 - 3 xyz. 
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Problems 

1.4.1. Prove that there is a linear change x' = ax + b that reduces the curve 

y 2 = {x - X\){x - x 2 ){x - * 3 ) 

to the form 

y 2 = x'{x' — l){x' — A), where A = (#3 — x\) : (x 2 — x\). 

In this way we can get 6 distinct values of A for the same curve, namely, 

A, A -1 , 1 — A, (1-A) -1 , (A — 1)A -1 , A(A-l) -1 . 

1.4.2. Let F — 0 be the equation of a cubic, and H = 0 the equation of its 
Hessian. Prove that A F+jiH — 0 is the equation of a cubic with the same inflection 
points as the initial curve (assuming that the initial curve is nonsingular). 

1.4.3. Prove that the curve x 3 + y 3 + z 3 = 3\xyz is nonsingular if and only if 
A 3 7^ 1. 

1.4.4. Prove that the curve x 3 + y 2 z + axz 2 = 0 is the Hessian of its own 
Hessian. 

1.4.5. Prove that the curve x 3 + y 3 ~b z 3 — 3jixyz is the Hessian of the curve 

x 3 + y 3 + z 3 = 3A xyz if and only if y, = — , 

§1.5. Singular cubics 

The equation of a nonsingular cubic curve can be written in the form 

y 2 = (x- x\){x - x 2 )(x - x 3 ), 

where the numbers aq, x 2 and x 3 are distinct. In the real case such a curve is shown 
in Figure 11. 



Figure 11 


Let x\ < x 2 < x 3 . When the roots X\ and x 2 merge, we get a curve of the form 
y 2 = x 2 (x — 1) (see Figure 12(a)); when the roots x 2 and x 3 merge, we get a curve 
of the form y 2 — x 2 (x + 1) (see Figure 12(b)). Over E these curves are distinct, 
but over C the distinction between them disappears. 

If all the three roots merge we get the curve y 2 = x 3 (see Figure 12(c)). For 
all three curves the origin is a singular point. 

Any straight line y = kx intersects the curves y 2 — x 2 {x ± 1) and y 2 = x 3 
at the singular point with multiplicity at least two. Indeed, for the equations 
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(a) (b) (c) 


Figure 12 

k 2 x 2 = x 2 (x± 1) and k 2 x 2 — x 3 the root x = 0 is at least a double one. Therefore, 
for any straight line connecting the singular point with another point on a cubic 
the third intersection point is the singular point. Therefore, the addition of the 
singular point to any other point should always give the singular point. Hence, we 
cannot define addition for the singular point. But if we exclude the singular point, 
then for either of the curves y 2 = x 2 (x + 1 ) or y 2 = x 3 the addition of points is 
well defined. If for the zero element we take the infinite point in both cases, then 
the curve y 2 = x 2 (x + 1 ) over R. turns into the group of nonzero real numbers 
with respect to multiplication and the curve y 2 — x 3 turns into the group of real 
numbers with respect to addition. 

(Over C we get the group C \ { 0 } with respect to multiplication and C with 
respect to addition, respectively.) 

Let us start with the curve y 2 — x 3 . This curve admits a rational parame¬ 
terization x — t~ 2 , y — t~ 3 . The intersection points of this curve with the line 
ax + by + c — 0 are determined by the relation ct 3 + at + b — 0. If the line does not 
pass through the singular point, then 0. In this case we get a cubic equation 
with the zero coefficient of t 2 . The sum of the roots of such an equation is equal 
to zero: h + t% + t 3 = 0. Let us take for the zero element E the infinite point 
corresponding to the parameter ts — 0. Assume tj± and ts to be the values of 
parameter corresponding to the points A and B of the given curve. The straight 
line AB intersects the cubic at a point A; we have tA + + tx — 0. The line 

EX intersects the curve at the point A + B, i.e., ts + tx + tA+B — 0. Hence, 
tA+B — —tx — tA + ts- Hence to add points on the curve y 2 — x 3 , we must add 
the corresponding values of the parameter t. Observe that to the singular point 
there corresponds the value t = oo of the parameter. 

The curve y 2 = x 2 (x + 1 ) also admits a rational parameterization. Indeed, let 
y = tx. Then t 2 x 2 = x 2 (x + 1), i.e., x — t 2 — 1 and y = tx = t 3 — t. The straight 
line ax 4 - by + c — 0 intersects the curve y 2 = x 2 (x + 1 ) at the point whose value of 
parameter satisfies the relation 

a(t 2 — 1 ) + b(t 3 — t) + c = 0 . 

If b 7 ^ 0, then after the division by b we get a cubic equation with coefficient 1 of t 3 
and — 1 of t. The roots of such an equation satisfy the relation tify+fyis+fyti = — 1. 

A simpler relation can be obtained after the reparameterization 

t=(l + r)(l-r)- 1 . 
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Indeed, it is easy to verify that T 1 T 2 T 3 = 1. For the zero element E take the infinite 
point corresponding to the value of parameter te = 1. To find the sum A + B, 
we must consider the point X at which the straight line AB intersects the cubic. 
Since taTbTx = 1 and TxTeTa+b — 1) it follows that ta+b ~ ta'Tb■ When we 
add points of the curve y 2 = x 2 (x + 1 ), we multiply the corresponding values of 
parameter r. To the singular point there corresponds not one but two values of 
each of the parameters t and r, namely, t = ±1 and r — 0, 00 (see Figure 13). 



Problems 

1.5.1. Prove that the points (tjf 2 ,t]f 3 ), ... , (fg 2 ,£g 3 ) on the curve x 3 = y 2 lie 

on a second degree curve if and only if t\ H- + t$ ~ 0. 

1.5.2. The curve y 2 = x 2 (x — 1) has a rational parameterization x — t 2 + 1, 

y = i 3 At. Prove that the points corresponding to the values of parameter t\, t 2 
and £3 lie on one straight line if and only if t\t 2 + t-it?, + — 1 . 

§1.6. No nonsingular cubic admits a rational parameterization 

The singular cubics that we studied in the preceding section admit a rational 
parameterization. Now we prove that none of the nonsingular cubics admits a 
rational parameterization. Recall that a nonsingular cubic can be reduced to the 
form y 2 = x(x — l)(x — A), where A 7 ^ 0, 1. 

Theorem. If A 7 ^ 0, 1 , then there are no polynomials Pi, P 2 , Q 1 , Q 2 such that 
the nonconstant functions y(t) — P\{t)/P 2 {t) and x(t) = Qi(t)/Q 2 (t) satisfy the 
relation y 2 = x(x — \){x — A). 

Proof. Suppose that P\{t)/P 2 (t) and Qi(t)/Q 2 {t) are not constants and 

P\ _ Qj_ Qi — Q 2 Qi — AQ 2 
P 2 Q 2 q 2 q 2 

Then we may assume that the polynomials Pi and P 2 are relatively prime and so 
are Qi and Q 2 . Since 

PiQl = p%Qi(Qi - Q2XQ1 - A Q 2 ), 

it follows that the polynomial P 2 , which is relatively prime to P 2 , is divisible by 
and the polynomial Q^, which is relatively prime to Q\, Q\ — Q 2 , and Q\ — \Q 2 , 
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is divisible by P 2 2 . Hence, the polynomials Q\ and P 2 2 are proportional to each 
other. Therefore, by replacing P\ with a proportional polynomial we can obtain 
the equality 

(6.1) Pi = Qi{Qi - Q 2 XQ 1 - AQ 2 )- 

Moreover, the polynomial Q\ is the square of a polynomial; hence, Q 2 is also the 
square of a polynomial. 

The polynomials Qi,Qi- Q 2 and Q x - \Q 2 are pairwise relatively prime and, 
therefore, the equality (6.1) implies that each of them is a perfect square. Thus, 
in the family of polynomials of the form aQi + f3Q 2 , where a, (3 6 C, there are 4 
perfect squares, namely, Q x , Q 2 , Qi - Q 2 and Qi - A Q 2 \ these polynomials are not 
proportional and they are distinct because A ^ 0, 1. 

To get a contradiction let us show that on the projective line aQi + (3Q 2 , where 
Qi and Q 2 are relatively prime, not more than three points can be perfect squares. 
Indeed, suppose that on this projective line there are four perfect squares: 

P 2 , P 2 , a x Rl - piRl and ol 2 R\ - f3 2 Rl- 

Since the polynomials Pi and R 2 are relatively prime, it follows that the polyno¬ 
mials ^faiRi ± \ZP 1 R 2 should be perfect squares. As a result, from the projective 
line aQi + [3Q 2 on which there are four perfect squares we come to the projective 
line oPi + (3R 2 on which there also are four perfect squares. From this projective 
line we can come to another projective line, etc.. But each such passage decreases 
the maximal degree of every polynomial of the form aQ 1 + f3Q 2 at least by a factor 
of two. Contradiction. □ 


Problems 

1.6.1. Give an example of relatively prime polynomials Q \ and Q 2 for which 
the polynomials Qi, Q\ + Q 2 and Qi + 2Q 2 are perfect squares. - 



CHAPTER 2 


Elliptic Functions 


The addition of points on the circle is related to its parameterization by the 
functions sine and cosine. Indeed, consider the map / : R —> 5’ 1 given by the 
formula f(t) = (cos t, sin t). This map parameterizes the circle by real numbers in 
such a way that the addition of points on the circle corresponds to the addition of 
real numbers. 

A similar parameterization exists for cubics. It is obtained by means of elliptic 
functions. Under this parameterization the addition of points on a cubic defined in 
Chapter 1 corresponds to the addition of the values of the parameter. 

In this chapter we will study the main properties of elliptic functions and show 
how one can parameterize a nonsingular cubic with their help. 

The name elliptic functions stems from ellipse , but the relation is rather indi¬ 
rect. It is elliptic integrals that are in direct relation to the ellipse. The length of 
an arc of the ellipse is expressed by an elliptic integral of a particular form. This is 
precisely where the name elliptic integrals stems from. Elliptic functions appeared 
in the process of inversion of elliptic integrals of another particular form not related 
to the computation of the arc length of an ellipse. 

Elliptic integrals appeared as early as the seventeenth century in calculations of 
arc lengths of certain curves, primarily ellipses. Apart from the ellipse an interesting 
example is Bernoulli’s lemniscate whose arc length is given by an integral of the 
form Jq x /f x x 4 • If was fo r this integral that the Italian mathematician Count 
Fagnano obtained in the first half of the eighteenth century the addition theorem 

j' a dx ± [ 13 d x _ d x 

Jo y/l - X 4 Jo a/I — X 4 Jo Vl ~ X 4 ’ 


where 

a^l — P 4 + (3\/l — a 4 
^ 1 + a 2 0 2 

Euler was interested in Fagnano’s studies. Euler managed to obtain the addi¬ 
tion theorem for integrals of an even more general form, 



where P(x) = 1 + mx 2 + nx 4 . 


Namely, he proved that 
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where 

av'Pp) + /?yrw 

1 — na 2 /? 2 

Euler also obtained addition theorems of an even more general form. 

After Euler, Legendre tirelessly worked for many years on the development 
of the theory of elliptic integrals. He summarized the results of his studies in 
the book Exercises de calcul integral (Exercises on Integral Calculus ), published in 
1811-1819. A revised edition of this book was issued in 1827-1832 under the name 
Traite des fonctions elliptiques et des integrates euleriennes (Treatise on Elliptic 
Functions and Euler Integrals). These are three large volumes that contain a vast 
number of theorems on properties of elliptic integrals and their applications. 

Legendre called elliptic functions what are nowadays called elliptic integrals. 
After works of Abel and Jacobi the importance of Legendre’s book dwindled. Abel 
and Jacobi themselves, however, referred to Legendre’s book with great respect, as 
befits it. 

Elliptic function theory proper began with Abel’s work Recherches sur les fonc¬ 
tions elliptiques (Studies on Elliptic Functions ) published in 1827-1828 in Crelle’s 
journal. Abel showed that the inversion of an elliptic integral of the first kind, 



dx 

a /(1 — cx 2 )( 1 + ex 2 ) ’ 


gives rise to a function (p(a) that has two periods in the complex domain. Abel 
meticulously studied the equations that relate <p(ot) with <p(ria.). Jacobi started to 
study elliptic function theory almost simultaneously with Abel. This led to a tense, 
albeit short, competition between them. Without a permanent position, almost 
in poverty, Abel finished the second part of the Recherches ... and continued his 
intensive studies. But soon he became seriously ill and died in 1829 at the age of 
27. 

Long before Abel and Jacobi, Gauss had known much of what was discovered 
by them. But Gauss did not publish his results. 


* * * 


It is convenient to consider elliptic functions as functions of a complex variable. 
Many of their properties are developed only on the complex plane 1 C, and not on 
the real line E. The parameterization of a cubic curve is also more graphic over 
C. Therefore, we will start with the investigation of the topology of a nonsingular 
cubic in CP 2 . It turns out that from the topological point of view all such curves 
are alike: all of them are two-dimensional tori. 


§2.1. The topological structure of nonsingular cubics in CP 2 

The equation of any nonsingular cubic in CP 2 can be reduced to the form 

(LI) y 2 z — (x - aiz)(x - a 2 z)(x - a 3 z), 


1 In algebraic geometry this set is usually referred to as complex line-, but the authors use the 
term more usual among geometers. Translator. 
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where the numbers are pairwise distinct. This equation determines a complex 
curve in CP 2 whose complex dimension is equal to 1 and whose real dimension is 
equal to 2. 

To find the topological structure of the curve (1.1) in CP 2 , consider the pro¬ 
jection 

V : CP 2 \ {(0,1,0)} —> CP 1 , (x,y,z) (x,z). 

The complex projective line CP 1 (a one-point compactification of C) is homeomor- 
phic to the two-dimensional sphere S 2 . For 6^0 the equation y 2 —b has precisely 
two distinct solutions. Therefore, if z 0 and x—aiz ^ 0, then a point (x, z) € CP 1 
has exactly two preimages that belong to the curve (1.1). If z ^ 0 but x/z is equal 
to one of the numbers a*, then there is only one preimage. For z = 0 the equation 
(1.1) turns into the equation x 3 = 0. Therefore, the point oo = (1,0) also has only 
one preimage, namely, (0,1,0). More exactly, the preimage of the point (1, z) also 
tends to (0,1,0) as z —* 0. 

The projection p of the curve (1.1) on CP 1 is described as follows. If we exclude 
from CP 1 the points a%, a 2 , a 3 , and oo, then all points have exactly two preimages. 
The structure of the map in vicinities of points cq and oo should be studied in more 
detail. For simplicity, assume that ai = 0. Consider affine coordinates, i.e., set 
2 = 1. The projection of the curve (1.1) on CP 1 in this coordinate system can be 
described as (x,y) *—> x. Then (1.1) takes the form 

y 2 = x(x - a 2 )(x - a 3 ), 

where a 2 a 3 7^ 0. For points x close to zero the quantity (x — a 2 )(x — a 3 ) is almost 
a constant, i.e., we have almost the equation y 2 = cx. This equation has solutions 
of the form x = c\ 2 e 2 ' l ' p ,y = cXe l<p . As y> varies from 0 to i r, we perform a full 
revolution around the point (0, 1) on CP 1 . Under such a revolution, y changes 
sign. Lifting the revolution around (0, 1) on CP 1 to the curve (1.1) we do not get 
back the initial point (see Figure 14). But by performing the revolution once again 
we return to the initial point, since by changing the sign of yo twice we get i/q. 


( x o> Vo) 
( x o~Vo) 


The structure of the projection of the curve (1.1) on CP 1 in a vicinity of oo is the 
same as that in the vicinities of the ai. Indeed, let x = 1. Then in a neighborhood 
of 2 = 0 the equation (1.1) looks approximately as y 2 — ~ and, hence, the sign of 
y changes under a full revolution about the point z — 0. 



CZP 


Figure 14 
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Let us cut CP 1 from ai to a2 and from <23 to 00. The liftings of these cuts to 
the curve ( 1 . 1 ) divides it into two parts. Indeed, advancing along any closed path 
in CP 1 that does not intersect the cuts we will circumvent the points <21, <22, <23 
and 00 only in pairs and under the passage around two points the value of y does 
not change. Therefore, it is impossible to get from one preimage of a point of CP 1 
into its other preimage without intersecting the cuts. 



Figure 15 

If we cut CP 1 from a\ to a 2 and from 0,3 to 00, then the remaining part of 
CP 1 can be represented in the form of a plane with cuts, as in Figure 15 . The part 
of the curve ( 1 . 1 ) that lies above this plane consists of two pieces. We only have 
to understand how to glue these pieces. Traversing a cut in CP 1 we go from the 
boundary marked with a plus sign on one piece of the curve (1.1) to the boundary 
marked with a minus sign of the other piece. Hence, when the boundaries are glued, 
we get a torus (Figure 16). 


a 2 a 2 



a 3 a 3 


Figure 16 

A parameterization of a cubic in CP 2 can be determined by means of a map 
/ : C 1 —» CP 2 , where f(z) = (Fi(z), F 2 (z), 1 ). The image of this map should be 
a torus. The simplest map of C 1 to a torus is obtained by the identification of all 
points of the form 2 + nu 1 + mu> 2 ■ In other words, u\ and lo 2 are periods of the 
functions F\ and P2. 
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§2.2. The elliptic functions 

A function / is called doubly periodic if f(z + nu\ + mu^) = f( z ) f° r an y 
m,n 6 Z and some ioj and lu 2 such that loi/u 2 0 E; for definiteness, we assume 
that Im(cui/w 2 ) > 0. This means that the rotation from ccj to u< 2 on the complex 
plane 2 is performed clockwise (Figure 17). 



Figures 17 


In what follows we will only be interested in meromorphic doubly periodic 
functions. Recall that an analytic function is called meromorphic if in the finite 
domain of C it has no singular points other than poles. In a vicinity of any finite 
point a a meromorphic function / can be expanded in the series 

f(z) = c 0 (z - a) r +Ci{z- a) r+1 4-, 

where c 0 0 and r is an integer. A meromorphic doubly periodic function is called 
an elliptic function. 

a>i +io 2 



Figure 18 

Any complex number z can be represented in the form 2 = a^toi 4- a 2 a>2, where 
Oj £ E. The number a, t can be represented in the form of the sum of its integer and 
fractional parts and, therefore, an elliptic function is completely determined by its 
values in the fundamental parallelogram (Figure 18) 

{aicoi 4- <*2^2 | 0 < ai, «2 < !}• 

The image of the fundamental parallelogram under any parallel translation will also 
be called the fundamental parallelogram. 

2.2.1. Theorem. An elliptic function without poles is a constant. 


2 By the complex plane we mean the real plane ffi 2 with a complex structure, i.e., the complex 
plane is a 1-dimensional complex vector space. 
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Proof. Suppose that an elliptic function f(z ) has no poles. Then the function 
\f(z)\ is continuous on C. Since the fundamental parallelogram is compact, \f(z)\ < 
M for a number M. But then \f{z)\ < M for all z E C. Thus, / is a bounded 
analytic function on C. By Liouville’s theorem, / is a constant. □ 

All finite singular points of a meromorphic function are isolated. Hence, the 
fundamental parallelogram contains only a finite number of singular points. There¬ 
fore, there exists a parallel translation of the fundamental parallelogram such that 
there are no singular points on its sides. In what follows we will assume that there 
are no singular points on the sides of the fundamental parallelogram. 

Statement. Let P be the fundamental parallelogram with the vertices a, a + 
0 Ji,a+u>i +uj 2 and a + u> 2 , and dP its boundary (Figure 19). Then f gp f{z)dz = 0 
for any elliptic function f (z). 


CX + LO l + 



Figure 19 

PROOF. Indeed, this integral is the sum of certain expressions among which 
there are, e.g., /“ +Wl f{z)dz and f(z)dz with signs plus and minus, re¬ 

spectively. These integrals are equal, since f(z+U 2 ) — f( z )- Similarly, the integrals 
along the other pair of sides also cancel. □ 

This statement allows us to get essential information on zeros and poles of an 
elliptic function. 

2.2.2. Theorem, a) The sum of the residues of an elliptic function f(z) at 
singular points inside the fundamental parallelogram is equal to zero. 

b) For an elliptic function, let ai be those of its zeros and poles that lie inside 
the fundamental parallelogram, and their orders (positive for zeros and negative 
for poles). Then Yh r i — 0 and JfriOi = 0 (mod A), i.e., Y^ r iOi = mxo\ + 
where m and n are integers. 

PROOF. As we know from complex analysis, if on the boundary of the funda¬ 
mental parallelogram P there are no singular points of a meromorphic function g, 
then 

Y.™<> = hl gP 9(z)dz - 

To prove part a) it suffices to use this identity for g = /. 

To prove that Yl r i — 0 anc ^ Y2 r i a i = 0 (mod A) we set g(z ) = f'(z)/f(z) and 
g (z) = zf(z)/f{z), respectively. 

If f(z) is an elliptic function, then the function g{z) — f'(z)/f(z) is also an 
elliptic function. Moreover, if f(z) = cq(z — a) r + ci(z — a) 7 ' +1 + • • ■, then g(z) = 
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r(z — a) 1 + d\ + d 2 (z — a) + ■ • ■ and, therefore, the residue of g(z) at the point a 
is equal to r. Thus, J2 r i — 0. 

To prove the identity ]T) r i r M — m '^i +nu 2 we must perform certain calculations, 
since the function g(z) = zf'{z)/f{z) is not necessarily an elliptic one. First, 
observe that if f(z) = cq(z — a) r + c\(z — a ) r+1 -, then 


9(z) = 


a+ (z — a) 
z — a 


rc 0 (z -a) r 1 -)- 

c 0 (z - ay- 1 H- 


= ar(z - a) 1 H-; 


hence, the residue of g(z) at the point a is equal to or. Now, let us compute the 
integral J dp g(z)dz. The difference of the integrals 


L 


a+wi 


dz 


and 


/ 


a+ui+u> 2 


a+ o>2 


zf(z) 

f(z) 


dz 


zf'(z) 

m 

(z + U 2 )f(z) 


J 

J OL 


m 


dz 


contributes to this integral. This difference is equal to 


f a+u> i f'( z ) 

-»J jydz = -u, 2 lnf(z)\“+“K 

Since f(a + uq) = f(a), the logarithm of f(z) can change only by 2kiri as z varies 
from o to a + wi- As a result we see that contribution to the integral 


1 

2m 



g(z)dz 


from one pair of sides of the parallelogram is nu> 2 , where n is an integer. Similarly, 
the contribution from the other pair is mux. □ 


As we have already said, a nonconstant elliptic function must have at least one 
pole inside the fundamental parallelogram. But since the sum of the residues at 
the singular points that lie inside the fundamental parallelogram is equal to zero, 
the function cannot have exactly one pole of order 1 there. For an elliptic function, 
the number (multiplicities counted) of poles inside the fundamental parallelogram 
is called the order of the elliptic function. The minimal possible order is thus equal 
to 2 ; and there are two ways for such a possibility to realize itself: 

1) one pole of order 2 , i.e., of multiplicity 2 (this takes place for the Weierstrass 
function that will be discussed in the next section); 

2 ) two simple poles (this takes place for the Jacobi elliptic functions that will 
be discussed in § 2 . 8 ). 

By Theorem 2.2.2 b), for an elliptic function the sum of orders of the zeros 
inside the fundamental parallelogram is equal to the sum of orders of the poles, 
i.e., is equal to the order of the function. It is also clear that the poles of the 
function f(z) — c are the same as those of f(z). Therefore, an elliptic function of 
order r takes any finite value inside the fundamental parallelogram exactly r times 
(multiplicities counted). 
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Problems 

2.2.1. The elliptic functions / and g have the same periods and at every pole 
they have the same principal parts 

c r (z — a) r + c r +\(z — a) r+1 +-b c-\(z — a)~ 1 (here r < 0). 

Prove that the difference of these functions is a constant. 

2.2.2. The elliptic functions / and g have the same periods and their zeros and 
poles (multiplicities counted) coincide. Prove that the ratio of these functions is a 
constant. 

§2.3. The Weierstrass function 

We have already proved certain properties of elliptic functions but we have not 
yet established that there exist nonconstant elliptic functions. It is time to give 
an example of a nontrivial elliptic function. Let us show that for any lattice A the 
function 

(3.1) p(z) = “2 + E 7- Y2 - 2 ’ 

where the sum runs over all nonzero elements u € A, is an elliptic function. (The 
fact that the summation runs over nonzero elements only is denoted by a prime.) 
The grouping of terms in square brackets is essential, since each of the series 
Y'( z ~ u )~ 2 and Y! U} ~ 2 diverges. 

First, let us prove that the series (3.1) does indeed define a meromorphic func¬ 
tion. On any compact set K that does not contain the points of the lattice, this 
series converges uniformly and absolutely. Indeed, 

1 1 _ 2 zlo — z 2 _ u 2 z — z 2 uj~ l 

(z — u ) 2 u 2 u 2 (z~u ) 2 to 4 {zw~ l — l) 2 ’ 

If M is sufficiently large, then (fff-i-i) 2 ~ ^ z - Therefore, for all u £ iV with a 
sufficiently large value of |a>| and for all z € K there exists a constant C such that 

1 1 ^ C 

(z — u) 2 < M 3 ' 

Moreover, \z — to\ > e for all z € K and u € A'; hence, such a constant exists for 
all u) e A' as well. It is easy to verify that the series Y' l^l" 3 converges. Indeed, 

oo oo 

E 1 -r 3 = E E b^i+gw 2 r 3 < E 8n ( n/i ) -3 ’ 

n=l ma x(p,q)=n n=l 

where h — min(|cLi |, |a; 2 |) is the smallest of two sides of the fundamental parallelo¬ 
gram. Thus, p(z) is a meromorphic function with poles at the nodes of the lattice. 
It is called the Weierstrass function. Let us prove the periodicity of p(z). For that, 
let us consider its derivative 

*>'(*) = -2E(*-«r 3 . 

Here the summation runs over all the nodes of the lattice. Clearly, uq and 0 J 2 are 
periods of the function p'(z). Hence, the functions p(z + uf) and p(z) can only 
differ by a constant c. Substituting z = — uq/ 2 into the equality p(z+u>i) — p(z)+c 
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we get p{u)i/ 2) = p(—oij/2) + c. But from formula (3.1) it is clear that the function 
p(z) is even. Hence, c = 0, i.e., uji and o >2 are periods of p(z). 

The function p has double poles at the nodes of the lattice; and it has no other 
singular points. Inside the fundamental parallelogram there is exactly one node of 
the lattice. Therefore, inside the fundamental parallelogram the sum of the poles 
of p is congruent to zero modulo A. By Theorem 2.2.2, inside the fundamental 
parallelogram, there are two zeros of p, call them u and v, such that u + v = 0 
(mod A). For any constant c the poles of the function p(z) — c coincide with the 
poles of the function p(z) and, therefore, inside the fundamental parallelogram there 
are exactly two points, u and v , for which p(u) = p(v) = c and u + v = 0 (mod A). 
If u = — u (mod A), then these two points coincide, i.e., the corresponding value 
of p is attained twice. At the points where the two zeros of p(z) — c merge the 
derivative p'(z ) vanishes. It is possible to select the fundamental parallelogram so 
that it contains exactly four points for which u = —u (mod A), namely, the points 

U)\ . 0?2 j Oil + 0>2 

0 , —, and ——- 

2 2 2 

(Figure 20). The first of these points is the pole of p and the other three points are 
zeros of p'. 



Figure 20 


Thus, the values 

( o>i \ / \ , foj 2 \ 

ei=p[Y)’ e 2 = P ( 2 - 1 and e 3 = P [ y ) 

of p are of multiplicity 2 and there are no other values of multiplicity 2. The values 
of multiplicity 2 correspond to the zeros of the derivative; hence, p'(z ) — 0 if and 
only if 

11 1 
Z = -UJi, -(W1+W2), 2^ ( modA )- 

Observe that the numbers ei,e 2 and e% are distinct. Suppose, for example, that 
e\ = e ?J . Then the function p(z) — e\ has zeros of multiplicity 2 at the points cui/2 
and 012 / 2 , i.e., inside the fundamental parallelogram there are at least 4 zeros of 
this function. This is impossible. 

The Weierstrass function not only gives an example of an elliptic function but 
enables one to describe the structure of all elliptic functions. 
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Theorem. Let f(z) be an arbitrary elliptic function and p(z) the Weierstrass 
function with the same periods. Then there exist rational functions R and Ri such 


that 

f = R(p) + RMp'. 


PROOF. It is possible to represent f(z) as the sum of the even function g(z ) — 
%(f( z ) + f(~ z )) and the odd function h(z) = \(f(z) - f(-z)). Since p'(z) is an 
odd function, h\{z) — h(z)/p'(z) is an even function and for the even functions g 
and hi we have 

/ 0 ) = g{z) + h 1 (z)p'(z). 

Therefore, it suffices to prove that any even elliptic function can be represented 
as a rational function of p. 

First, let us prove certain properties of zeros and poles of an even elliptic 
function. 

1°. Let / be an even function, and u its zero (resp. pole) of order m. Then 
—u is also a zero (resp. pole) of order m. Indeed, in the case of zeros it suffices to 
observe that for an even function / we have 

/<*>(-*) = (-i)V w (*)- 


In the case of poles we may consider j instead of /. 

2 °. If / is an even elliptic function and u = —u (mod A), then the order of a 
zero or pole of / at u is even. We will prove this for zeros (since for poles we may 
consider 4 instead of /). The condition u = —u (mod A) is equivalent to the fact 
that 

u = y> -g-’ T (mod A). 

Moreover, the periodicity of f implies that f'(u ) = f'(—u). But the derivative of 
an even function is odd; hence, f'(u) = 0. Therefore, if the function / has a zero 
at u, then this zero is of multiplicity at least two. For any of the cases 


uq u>i'+ U>2 ^2 , i * \ 

U ~Y' -2~’ T (mod A) 

the function F(z) = p(z) — p(u) has a zero of order 2 at u and if u = 0 (mod A), 
then the function F(z) = 1 /p{z) has such a property. Using F, we can construct 
an even elliptic function f\{z) = f(z)/F{z) for which the order of the zero at u is 
less by 2 than that of /. Hence, if fi{u) ^ 0, then the order of the zero of / at u 
is equal to 2 and if fi ( u) — 0, then we can apply the same arguments to /i instead 
of /, etc. 

By the above properties of zeros and poles of the even elliptic function / they 
can be divided into pairs of the form (x, —x). Select a representative from each 
such pair with a\ , ... , ak the representatives of the zeros and bi, ... , bk the 
representatives of the poles. Consider the elliptic function 


Q(z) = R(p(z)) - 


Y[{p(z) - p{aj )) 
II (P( z ) - P( b i )) ’ 


where we take only those a,i and bi that are distinct from the nodes of the lattice 
(since at the nodes the function p takes infinite values). If we disregard the nodes 
of the lattice, then the complete system of zeros and poles of Q is the same as that 
of /, since p{z ) = p(a ) if and only if z ~ ±a (mod A). But by Theorem 2.2.2 b) for 
an elliptic function the sum of orders of its zeros and poles inside the fundamental 
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parallelogram is equal to zero; hence, the order of a zero or a pole at a node of the 
lattice is uniquely determined by the orders of the other zeros and poles. Therefore, 
f(z)/Q(z) is an elliptic function without poles, i.e., a constant. As a result, we see 
that f(z) = cR(p(z)). □ 


Problems 

2.3.1. All the poles of an elliptic function / lie at the nodes of the period lattice. 
Prove that / = P(p) + Pi(p)p', where P and Pi are polynomials. 

Hint. Prom the decomposition / = R(p) + R\{p)p' we can derive that 

2 R(p(z)) = f(z) + f(-z ) and 2p'{z)R 1 (p(z)) = f(z ) - f{-z). 

Prove that the functions R and Pi do not take infinite values at finite points. 

§2.4. A differential equation for the Weierstrass function p{z) 

In the preceding section we proved that an even elliptic function can be ratio¬ 
nally expressed in terms of p(z) and the expression was explicitly described. This 
can be applied to the even function ( p'(z )) 2 . It has zeros of multiplicity 2 at 
^ ± < £ 2 ; wp. anc [ a p 0 i e 0 f multiplicity 6 at a node of the lattice. Hence, 

(4.1) {p\z)) 2 = c{p(z) - ei ){p(z) - e 2 )(p{z) - e 3 ), 

where ei = p(%±),e 2 — p(— f^) and e 3 = p(^f-). Since p(z) = z ~ 2 + ••• and 
p'(z ) = —2 z~ 3 + • ■ •, it follows that c = 4. 

There is also another way to obtain a differential equation for p(z). It not only 
gives us a new method to deduce this equation, but also provides another form of 
this equation. We will use the fact that if the coefficients of nonpositive powers of 
z in the Laurent expansions of the functions ( p'(z )) 2 and ap 3 (z)+ bp 2 (z)+ cp(z) + 
d coincide, then these functions are equal. Indeed, their difference is an elliptic 
function without poles and at 0 its value is equal to 0. Hence, their difference is a 
constant equal to 0. 

Since 

(rb) = ^(r^) = 1+2:r+33:2 + '"’ 

it follows that 



— —o + 3 G 4 Z 2 + 5 Gqz^ + • • • , 

where Gk = (for k odd this sum is equal to zero). Hence, 

p(z) = 2 P 2 -I-, p 2 (z) = z~ 4 + 6G4 H- 

p 3 (z) — z ® + 9G 4 z 2 + 15(?g H~ • • ■ , 

( p'(z )) 2 = 4z~ 6 - 24 G 4 Z~ 2 - 80G 6 + • • • 


? 
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(only the terms of the Laurent expansion of interest to us are written). Thus, 
ap 3 (z)Fbp 2 (z)-\-cp(z')-\-d — az ®-\-bz 4 + ( 9 a(j 4 +c),z 2 + ( 15 G^( 76 ^■■ 66 Gr 4 +d) + • • • . 
Therefore, ap 3 + bp 2 + cp + d — (p 1 ) 2 if 

d = 4, 6 = 0, 9 o(j 4 ~b c ~ —24(?4 and 15ciGg 4- 66 C ?4 T d = —80C?g, 


i.e., 

a — 4, 6 = 0, c = — 6 OG 4 and d = --140(76. 

Set g 2 — 6 OG 4 = 60 ]T)'u /~ 4 and <73 = 140Gg = 140 Then 

( 4 - 2 ) ( p'{z )) 2 = 4 p 3 (z) - ^2p(^) - 93- 

Comparing (4.1) with (4.2) we see that 

p2 53 

ei + e 2 + e 3 = 0 , eie 2 + e 2 e 3 + e 3 e\ = —— and e^es = ——. 

4 4 

It is easy to verify that 

92 ~ 27<?3 = 16(ei - e 2 ) 2 (e 2 - e 3 ) 2 (e 3 - ei) 2 . 

In the preceding section it was shown that the numbers ei, e 2 and e 3 are distinct. 
Therefore, g% — 27 g 2 7 ^ 0. A natural question arises: 

Given numbers 52 and .93 such that g\ 7 ^ 27<7 2 , is there a lattice for which 
<72 = 60 JO 'a ; -4 and <73 = 140 'tu -6 ? 

The answer to this question is an affirmative one; see, e.g., [B9], 


Problems 

2.4.1. Prove that p" = 6 p 2 - < 72/2 and p'" = 12 p'p. 

Hint. Differentiate (4.2). 

2.4.2. Prove that p^ 2n ~ 2 \z) and p( 2n+1 ' > (z)/p'(z) are polynomials in p(z) of 
degree n. 

2.4.3. Prove that e 2 + e 2 + e 2 = ef + e 3 + e 3 — and e\ + e 4 + e% — 

Hint. Use the fact that a + b + c divides both a 3 -f- 6 3 + c 3 — 3a6c and 
(a 2 + 6 2 + c 2 ) 2 - 2 (a 4 + 6 4 + e 4 ). 

§2.5. A parameterization of the cubic 
with the help of the Weierstrass function 

The differential equation for p enables us to clarify the nature of the addition 
of points on the cubic. To this end we have to use the fact that we left the following 
without proof earlier: 

For any numbers g 2 an( l .93 such that g 3 / 27g 2 there exists a lattice for which 
the Weierstrass function satisfies the equation 

(p') 2 = 4p 3 - g 2 p ~g 3 . 
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The cubic curve y 2 — Ax 3 — g 2 x - g 3 can be parameterized with the help of p 
setting x = p(z) and y = p'{z). Passing to homogeneous coordinates in CP 2 the 
map / : C/A —► CP 2 can be defined as 

. / (p(z),p'(z),l) for z^O, 

1 (z 3 p(z), z 3 p'(z), z 3 ) - ( 0 , 1 , 0 ) for 9 = 0 . 

Obviously, this map is analytic at all points distinct from the nodes of the lattice. 
Expressing it in the form 


2 i—> 


p(z) 1 1 \ 

?(*)’ ’p'(z)J 


we can verify that it is analytic in a neighborhood of the node of the lattice as well. 
The map / is a one-to-one map of the torus C/A to the cubic y 2 z = 4ic 3 — g 2 xz 2 — 
g 3 z 3 in CP 2 . 

Indeed, on the infinite line z = 0 there lies only the point (0,1,0) of this curve; 
the nodes of the lattice which correspond to one point on the torus are mapped into 
this point. For all other points we can consider an affine curve y 2 — Ax 3 — g 2 x — g 3 
and the map 2: 1 —» (p(z), p'{z))- 

The equation p(z) = c may have either one or two solutions. It has two 
solutions if p'(z ) 0. The solutions then are of the form ±z. The images of these 

two points under the map 2: 1—> (p(z),p'(z)) do not coincide, since the nonzero 
numbers p'(z) and p'(—z) — — p'(z) differ by a sign. 

The addition of complex numbers induces an addition of the points on a torus 
which, in turn, with the help of the map / induces an addition of the points on a 
cubic. It turns out that this is precisely the addition of points on a cubic defined 
in §1.1 if for the zero element we take the infinite point (0, 1, 0). 

Let the points Pi and P 2 on a cubic correspond to the points z\ and z 2 in C, 
i.e., Pi = ( p(zi ), p'(zj)). Let us draw the straight line y — ax + b through P\ and 
P 2 . Then p'(z'i) — ap(zi) + b, where i — 1, 2. : 

At the point 2: = 0 the elliptic function p'{z)—ap(z)—b has a pole of multiplicity 
3 and it has no other poles at the other points of the fundamental parallelogram. 
Therefore, the order of this function is equal to 3, i.e., it has precisely 3 zeros, 
namely, the already known zeros Zi and z 2 and a third zero z 3 . Since the sum 
of the poles and zeros is equal to zero, we have Z\ + z 2 + z 3 = 0 (mod A), i.e., 
z 3 = —zi ~ z 2 (mod A). Thus, the third point of intersection of the line P\P 2 with 
the cubic is the point 


H = (pteO, p'(z 3 )) = (p(-zi - z 2 ), p'(-z 1 - z 2 )) 
= (p(zi + Z 2 ), -p{zi + z 2 )). 


Therefore, the point P 3 = {p{z\ + z 2 ), p'{z\ + z 2 )) corresponding to the sum of 2 q 
and z 2 is symmetric to P 3 with respect to the cc-axis (see Figure 21 on the next 
page). In other words, P 3 is the point of intersection of the cubic with the straight 
line P 3 P, where E — (0,1,0) is the infinite point on the cubic. This is precisely 
what we wanted to establish. 

By extending the above arguments a bit further, one can show that there exists 
an algebraic addition theorem for p, i.e., p( 2 q + z 2 ) can be algebraically expressed 
in terms of p(^i) and p(z 2 ). Indeed, the line y = ax + b passing through the points 


:®fS 
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Figure 21 


Pi and P 2 intersects the cubic y 2 = 4a: 3 — g 2 x — <73 at the three points (xi,yi), where 
x\ = p(zi),X 2 = p(z 2 ) and X 3 = p{z\ -f- ^ 2 )- Therefore, the cubic equation 


{ax + b) 2 = 4a; 3 - g 2 x - g 3 


has the indicated roots X \, x 2 and x 3 . Expressing the coefficient of x 2 in terms of 
these roots we get 

a 2 

p{z 1 ) + p{z 2 ) + p(z 1 + z 2 ) = 


Since p'{z \) = ap(zi) + b and p'{z 2 ) = ap{z 2 ) + b, it follows that a — 
Therefore, 


p'(zi)~p'(z 2 ) 

p(Zl)~P(Z2) 


p(zi + *s) - - Pfe) + \ ( ' 

Thus, + 22 ) can be rationally expressed in terms of the p(zj) and p / (^), 
z = 1,2. It remains to recall that p'(zi) can be algebraically expressed in terms of 
pfa), namely: 

p'(Zi) = V / 4^5 3 (2 i ) - g 2 p{Zi ) - .93- 


* * * 

With the help of the Weierstrass function one can also parameterize the curve 

y 2 - G 4 (x), 

where G 4 (x) = a Q x 4 + aix 3 + a 2 x 2 + a 3 x + a 4 is a fourth degree polynomial without 
multiple roots. To this end let us make the change of variables x = x± l + a, 
V = yi^i 2 . We get 

y\x ~{ 4 = b 4 x ± 4 + b 3 Xi 3 + b 2 x ^ 2 + bix ^ 1 + G 4 (a). 

If a is a root of G 4 , then y\ — 61 a ; 3 + b 2 x 2 + b 3 x 4 + 64 . This cubic can be pa¬ 
rameterized with the help of the Weierstrass function. Notice that the change of 
variables 

x = a:^ 1 + a, y = y\X^ n 

enables us to pass in a similar way from the curve y 2 = G 2 n {x) to the curve 
y 2 = G 2 n -i{x). 
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§2.6. The elliptic integrals 

The Weierstrass function p(z) satisfies, as we have seen, the differential equa- 

2 

= 4 p 3 - g 2 p~ 93 - 


dp 


tion 

(dp) 


\dz) 

Therefore, 

dz — 

i.e., 



J 


V 4 P 3 - 92 p- 9 z 
du 


V 4 u 3 - g 2 u - g 3 ’ 
where u — p(z). Thus, z = p -1 (/u), i.e., the inversion of the integral 


du 


V 4u3 - g 2 u- g 3 


gives rise to the Weierstrass function. 

An elliptic integral is an integral of the form 


R(x , y/G(x))dx, 


where G(x ) is a polynomial of degree 3 or 4 without multiple roots and R(x,y) 
is a rational function of two variables. At first, such integrals appeared in the 
calculations of arc lengths of various curves, for instance, ellipses. Only later it 
was noticed that for certain elliptic integrals the inverse functions possess more 
interesting properties, primarily, double periodicity. 

Elliptic integrals can be reduced to certain simpler integrals. Let us prelimi¬ 
narily consider integrals of a form less involved than that of elliptic integrals. First 
of all, let us prove that if R(x) is a rational function, then f R(x)dx is the sum of 
a rational function and a certain number of summands of the form cy In (sc — a,;). It 
suffices to prove that a rational function R(x) can be represented in the form 


A (x) + J2 

i,k 


Ci,k 

(x - ai) k ’ 


where A{x) is a polynomial. Let R(x) — P(x)/Q(x), where P and Q are polyno¬ 
mials. Dividing P by Q with a remainder we can pass to the fraction S/Q, where 
deg S < deg Q. Let Q — Q1Q2, where Q\ and Q 2 are relatively prime polynomi¬ 
als. Then there exist polynomials a and b such that a(x)Qi(x) + 6 (t)Q 2 (t) = 1 . 
Therefore, 

S _ aiSQi + a 2 SQ2 _ aiS a 2 S 

Q1Q2 Q1Q2 Q2 Qi 

In the fractions obtained one should also divide the numerator by the denominator 
with a remainder. After several such operations we arrive at the sum of a polynomial 
A(x) and several fractions of the form p(x)(x— a) -n , where degp(a;) < n. The proof 
is completed by expressing the polynomial p{x) in the form 


p(x) = bi(x - a) n 1 +b 2 (x-a) n 2 -j - hb n . 
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Leibniz was the first to study integration of rational functions. He only consid¬ 
ered factorization of polynomials into factors with real coefficients and, therefore, 
he faced the question: whether or not it is true that any real polynomial can be fac¬ 
tored into factors of degree 1 and 2 with real coefficients. In 1702 Leibniz published 
a paper in which he claimed that it is impossible to factor the polynomial x 4 + a 4 
in the required fashion since 

x 4 + a 4 — ( x 2 + a 2 \/— l){x 2 - a 2 V^-i) 

— (x + a\/V—T)0c - o-V V-l )( x + 1)(® _ av / -v / -I) 

and the product of any two of these factors cannot be, as he believed, a quadratic 
with real coefficients. Only 17 years after that Nicholas Bernoulli (1687-1759) 
indicated that 


x 4 + a 4 — (x 2 + a 2 ) 2 — 2a 2 x 2 — (x 2 + \[2 ax + a 2 )(x 2 — V2 ax + a 2 ). 

In their correspondence Leibniz and Jacob Bernoulli also discussed integrals of 
irrational expressions that appear in the study of various physical and mathematical 
problems. Many of these integrals are elliptic ones. 

Let us now pass from rational functions to the simplest irrationalities. To 
calculate the integral 

J R(x , a/ G(x))dx, 

where G(x) — ax + b is a linear function, we first fhake the change of variables 
u — ax + b. As a result we get an integral of the form 

f Rl ( u, \fu)du , 

where Ri is again a rational function. Now, set t = ffixi. Then du — d(t 2 ) = 2 tdt 
and, therefore, 



Ri(t 2 ,t) 2 tdt = J R 2 {t)dt, 


where Rz is a rational function. 

Now, let G(x ) = ax 2 + bx f c. As has been said in the preceding section, with 
the help of the change of variables x = xf 1 + a, y — yixf 1 we can pass from the 
curve y 2 = G(x) to the curve y\ — G i(»i), where Gy is a linear function. Let us 
apply this change of variables in order to calculate the integral / R(x, y)dx, where 
y 2 — G{x). Let x — xf 1 +a and y — yixf 1 , where C7(o!) — 0. Then dx = —xf 2 dxy 
and 


J R(x, y)dx — 


R(xy 1 + ct, y\x x x )x x 2 dxy — 


Ri(xi,yi)dxy, 


where y 2 — Ax \ + B. 

Thus, the integrals of the form f R(x,y)dx, where R is a rational function and 
y = \JG(x), can be expressed in terms of elementary functions if deg G <2. In the 
case when deg G = 3 there may appear functions which are inverse to elliptic ones. 
The integral f R(x, y)dx , where y = y/Gffx), reduces to the integral f Q(x, y)dx, 
where y — -\/4x :i — gzx — g?,. Indeed, using the change of variables x = xf 1 + cc, 
y = y \X X we can pass from the fourth degree polynomial G 4 to a third degree 
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polynomial and from an arbitrary third degree polynomial we can pass with the 
help of a linear change to a polynomial of the form 4 x 3 — g 2 x — <73. 

We could have confined ourselves to the calculation of integrals of the form 
f R(x,y)dx, where y 2 = 4 .x 3 — g 2 x — §3, but in many cases certain other forms of 
elliptic integrals are convenient. Therefore, we will first calculate elliptic integrals 
in the general form and later on we will study certain special forms. 

Let I — f R(x,y)dx , where R is a rational function, and 

y 2 = aox 4 + 4 aiX 3 + 6a 2 x 2 + 4a,3X + 04, 
where at least one of the coefficients ao and a\ is nonzero. 


Theorem (Legendre). The elliptic integral I can be represented as a linear 
combination of a rational function in x and y, the integral of a rational function of 
x, and of the integrals 



and 


/ 


dx 

(x - c)y ' 


PROOF. Since y 2 can be polynomially expressed in terms of x. we can assume 
that a rational function R does not contain y k for k > 2 . Moreover, 

a + by _ (a + by)(c - dy)y _ A B 
c + dy (c + dy)(c-dy)y y 

where A and B are rational functions of x. Therefore, the calculation of the integral 
f R(x,y)dx reduces to the calculation of the integrals f B(x)dx and f A 'B dx , The 
rational function A(x) can be represented in the form 

Hence, it remains to consider the integrals of the type 


Jr, 


-I 


x n dx 

y 


(n > 0 ) and H r 


dx 


[x — c) m y 


(m > 1). 


Since 


4-(x m y) = mx Tn ~ l y + x md ^ = -- 
dx dx y 


mx 


m-1,,2 , >.#! 
y + 2 dx 


= (m + 2)a 0 


x 


m+3 


y 


T 2 ( 2 m T 3 )ai 


x 


m +2 


y 


+ 6(m + l)a2 


x 


,m +1 X m ~^ 

-h 2(2 m + 1)03-h ma4 . - 

y y y 


it follows that 


x m y ~ (m + 2)a 0 J m +3 + 2(2m + 3 )oi J m+2 

+ 6 (m + l)a2 Jm+i T 2(2 m + + ma^Jm—i. 

Using these identities for m — 0 , 1 , 2 ,..., we can consecutively express J3 in terms 
of J 0 , Ji, J2 (and a rational function of x and y), then J4 in terms of Jo, Ji, J2, 
etc. (In the case when ao ~ 0 we will express J 2 in terms of Jo and Ji ; next, J3 in 
terms of Jq and Ji, etc.) 
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To compute the integrals H m — f 


dx 


( x—c) m y 


, let us write the polynomial G(x) in 


the form 

G(x) = b 0 (x — c ) 4 + 4 bi(x — c ) 3 + 6 b 2 (x — c ) 2 + 4 b 3 (x - c) + 64 , 

where 60 = «o- As in the preceding case, we get the identity 

d ( T — r ) m +3 ( r _ r \ m +2 

-j-[(x ~ c) m y\ = (m + 2)6 0 --— + 2(2m + 3)6 X -—— 

ax y y 

. (x — c ) m+1 , (x — c) m (x — c) m ~ l 

-f 6(m -f 1)62--H 2(2 m - j - 1)63--h 77164- 

y y 

Integrating these identities for m = —1, —2, —3, ... we get 

(;x — c) 2 


y 


y 

X — c 

y 

(x — c ) 2 

y 

(x — c ) 3 


/ 


y 


-dx + 26 x 


x — c 

y 


dx 


bo Jo 


— 26i Jo 
- 6 M! 


-263^1 ~6 4 tf 2 , 
66 2 U x -663^2 -264^3, 

12b 2 H 2 — IO& 3 U 3 — 364 if 4 , 


These identities enable us to express Jf 2 , # 3 , Ha , • • • In terms of Jo, J x , J 2 , Hi 
and rational functions of x and y. □ 


As we have already mentioned, any elliptic integral can be reduced to the 
integral f R(x, y)dx, where 


y 2 = 4a ; 3 - g 2 x - g 3 . 


This form of elliptic integrals is called the Weierstrass form. Since in this case 
ao = 0, it follows that J 2 can be expressed in terms of Jq and J x ; therefore, there 
are three types of integrals from which the rest may be calculated: 


/ 


dx 


yj4x 3 


92X - go 


■/ 


xdx 


y/4 


and 


dx 


X-i - g 2 x _ g 3 


(x - c)y/4x 3 - g 2 x - go 


Another widely used form of elliptic integrals is the Legendre form. For it the 
equation 

y 2 — (1 — a: 2 )(l — k 2 x 2 ) 

is used. One can pass from the Weierstrass form to the Legendre form as follows. 
Using linear change of variables x = ax\ + 6 we pass from 4a ; 3 — g 2 x — go to 
a:i(a;i — 1)(aq — k 2 ). Next, let us make the change of variables £ 2 = o:^ 1 , rj 2 = 
y 2 xf 3 . We get rj 2 = (1 — £ 2 )(1 — k 2 £ 2 ). 

For the Legendre form, all four types of integrals appear: 

/ dx f xdx f x 2 dx f dx 

y/G{x)' J y/G(x y J y/G(x) ’ J (x - c)y/G(x) ’ 
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where G(x) = (1 — rr 2 )(1 — k 2 x 2 ). But, since 


/ 


xdx 


y/(l — X 2 )(l — k 2 x 2 ) 2 J y^(l — u)( 1 — fc 2 «) ’ 


/ 




where u — x 2 , this integral can be expressed in terms of elementary functions. 

To simplify somewhat the form of the integrals in the Legendre form, make the 
change of variables x = sin p. Then 


dx — cos pdp, — x 2 = cos 9 ?, yj\~ k 2 x 2 = y 1 — k 2 sin 2 p. 

Therefore, the above mentioned (nonelementary) integrals take the form 


dip 

\/l — k 2 sin 2 ip 


/ sin 2 ipdip f d<p 

\/l — k 2 sin 2 ip J (sin ip — c) y/l — k 2 sin 2 ip 


These integrals are called elliptic integrals of the first, second , and third kind, 
respectively. 

sin 2 ipdip 


Observe that instead of the integral J 
J y/l — k 2 sin 2 ipdip because 


\/l — k 2 si 


sin 2 ip 


we may take the integral 


k* 


f sin 2 p dp 

f dp 

_ / 

J \/l — k 2 sin 2 p J 

y/l — k 2 sin 2 p 

J 


1 — k 2 sin 2 p) dip. 


/ 


sin 2 ipdip 


as 


Remark. We abuse the language a bit when we refer to , ___ _ 

yl — k 2 sin 2 <p 

the elliptic integral of the second kind because in the literature this term is applied 
to the integral f yT — k 2 sin 2 ipdip. 

For the elliptic integrals of the first and second kind Legendre’s notations are 
used: 


r‘P 

F{V) = / 
Jo 


dp 

\f\ — k 2 sin 2 


and 




r‘P 

E(p) = 

Jo 


1 — k 2 sin 2 pdp. 


Problems 

2.6.1. Prove that a change of variables reduces the integral f (1 + x 6 )~~ 1//3 dx to 
an elliptic integral. 

Hint. Set x ~ 3 + x 3 = 2 1 ~ 3/2 . 

2.6.2. Prove that a change of variables reduces the integral /(I — x 3 )~ 2 ^ :i dx to 
an elliptic integral. 

Hint. Set t( 1 — x) = (1 — a: 3 ) 1 / 3 . 
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§2.7. Addition theorems for the elliptic integrals F(p) and E(<p) 

Set 

F i<P) = an d E(p) = A(p)dp, 

where A(</>) = \/l — k 2 sin 2 p. If F(p) + F(ip) — F(p ), then sin// can be alge¬ 
braically expressed in terms of siny? and sin-0. To prove this, consider the differ¬ 
ential equation 


(*) — - dip + , _ 

yl — k 2 sin 2 p i/l — k 2 sin 2 ip 

Its integral is F(p) + F(ip) — F(p) = 0, where p is a constant. Taking into account 
that F is an odd function, the integral can be expressed in the form F(<p) + F(ip) -f 
F(~p) — 0. Let us show that the integral of the differential equation (*) satisfies 
the relation 

(7.1) 


dip 


0 . 


cos p cos ip — sin ip sin ip ■J 1 — k 2 sin 2 p = cos p 
which can be rewritten after squaring in a more symmetric form: 

(7.2) cos 2 ip + cos 2 ip + cos 2 p — 2 cos <p cos ip cos p, + k 2 sin 2 tp sin 2 ip sin 2 p —1. 
The term “symmetric” means that not only is (7.1) satisfied but also the relations 

(7.3) 


cos p cos ip + sin p sin <p\f 1 — k 2 sin 2 ip = cos ip 


(7.4) cos p cos ip + sin p sin ip y 1 — k 2 sin 2 ip — cos p, 

since the arguments p, ip and —p enter (7.2) symmetrically. 

Divide both sides of (7.1) by sin p sin ip and differentiate the obtained expres¬ 
sion. The result can be expressed in the form 

( cos ip — cos p cos p \ ,, ( cos p — cos p cos ip \ 

dp - H - z_ + dip( - r ^ = 0 . 

\ sm p J \ sin ip J 

Making use of formulas (7.3) and (7.4) we get 

dp i dip 


+ 


0 . 


\/l — k 2 sin 2 p y/\ — k 2 sin 2 ip 

Thus, (7.1) is indeed an integral of the differential equation (*). But it cannot have 
two independent integrals and, therefore, the equality F(p) + F(ip) = F(p) implies 
that 

cos p cos ip — sin psinip\J 1 — k 2 sin 2 p = cos p. 

This provides us with an implicit expression for cos p. It is not difficult to get an 
explicit expression also. Let x = cos//; then sin 2 p — 1 - ,x 2 . Relation (7.2) can be 
considered as a quadratic expression in x. By solving it for x we get 

(7.5) cos p - cos< 7 ;cos ^ A(0) 

1 — k 2 sin 2 p sin 2 ip 

(The sign is chosen so that the formulas (7.5) and (7.1) are compatible for small p 
and ip.) 
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By direct algebraic transformations we can derive from (7.5) the following ex¬ 
pressions for sin /i and A(/r): 

sin p cos ipA(ip) + sin ip cos pA(p) 


(7.6) 


(7.7) 


1 >2 • 2 • 2 / 

1 — sin p sin ^ 

A(p)A(ip) — k 2 sin p sin ip cos p cos ip 


A(/i) = 


1 — k 2 sin 2 ip sin 2 ip 
tan<^A(<£>) + iaxnpA(ip) 


Dividing (7.6) by (7.5) we get 

^ ^ 1 M l — ta,npta,nipA(p)A(ip) 

The latter formula can be interpreted as follows. Let the angles ip' and ip 1 be such 
that t&np' = taxnpA(p) and tan-0' = tanipA(ip). Then p = p' + ip'. 

In applications the case when p — is quite important. In this case cos p — Q 
and sin p = 1. From (7.3) and (7.4) we get sin</? = cosip/A(ip) and sin ip = 
cosy/A(<p), and from (7.1) we get cosipcosip = bsinpsmip, i.e., bt&nptaxxip = 1, 
where b — s/l — k 2 . It follows from (7.5) that 

cos ip cos ip — A(p)A(ip) sin ip sin ip 

and, therefore, A(tp)A(ip) = b. Hence, 

b sin ip _ __ _ ( b sin <p 


cos <p = sinipA(p) = 


A {ip) 


and cos ip 


A(p) ' 


Formulas (7.5) and (7.6) resemble, to some extent, the formulas for the cosine 
and sine of the sum of two angles. With their help we can obtain expressions similar 
to expressions of cos nip and sin rup in terms of cos ip and sin</?. Let F{ip n ) = nF{<p). 
Then 

2 sin ip cose? A (ip) 1 — 2 sin 2 ip + k 2 sin 4 ip - 

Sm</?2 = —- . ■ , . , COS ip 2 = - 


1 


k 2 sin 4 p 
,2 .. , ,„2 , 


A iF2) 


1 — 2 Ar sin p + k sin ip 


tan ip 2 = 


1 — k 2 sin 4 p> 
2tan<^A((^) 

1 — (tan</?A(<^)) 2 


1 — k 2 sin 4 ip 

To find ip from a given p 2 , we can use the fact that tan(^) = tanpA(p) and may 
also solve the equation 

1 — 2ic 2 + k 2 x 4 

cos <p 2 = x _ 1 2^4 > 

where x = sin p. This equation corresponds to the division of F(p 2 ) in halves. 

To divide F(ip) into three equal parts, we must solve the equation 

3x — 4(1 + k 2 )x 3 + 6k 2 x 5 — k 4 x 9 
S ^ n ^ 1 — 6k 2 x 4 + 4fc 2 (l + k 2 )x & — 3 k 4 x 8 ’ 

where x — sin p. For ip = \ we get the equation 

(1 + x)(l — 2x + 2k 2 x 3 ' - k 2 x 4 ) 2 = 0, 

i.e., the division of F(ir/2) into three equal parts reduces to the solution of the 
equation 

1 — 2 sin p + 2 k 2 sin 3 p + k 2 sin 4 p — 0. 
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For the elliptic integral of the second kind, E(p), there is only an addition 
theorem with an extra algebraic term. It is directly connected with the addition 
theorem for the elliptic integral of the first kind, F(p). 

Theorem. If F(p) + F(ip) - F(y) = 0, then 

E(ip) + E(ip) — E(y) = k 2 sin (/sin ip sin y. 


Proof. Let E(p)+E(ip) — E(y) = P(<p, if, y). Let us differentiate this equality 
for a constant value of y. 

As a result we get 

A(p)dp + A{ip)dip — dP. 

But by (7.3) and (7.4) 


. . . cos p — cos ib cos y , . . ,. cos ip — cos p cos y 

A(p) = - and A W = ' 


Hence, 


dP 


sin ip sin y 

' cos (p — cos ip cos y ' 
sin ip sin y 


dip + 


sm (p sm y 

' cos ip — cos p cos y \ 
sin ipsmy ) 


dip 


d(sin 2 p + sin 2 ip + 2 cos p cos ip cos y) 

2 sin p sin ip sin y 

By (7.2) we have 

1 — sin 2 p + 1 — sin 2 ip — 2 cos p cos ip cos y = 1 — cos 2 y — k 2 sin 2 p sin 2 ip sin 2 y. 
Hence, 

d(fcsin(/?sin^sin ^) 2 . 2j/ . . . . . 

dP — —- ■■■ . : ■ . . = k d{ sm p sm ip sm y). 

2 sm p sm ip' sm y 

Since both P and sin p sin ip sin y vanish at p = 0, we get P = k 2 sin^sini/jsinyU. □ 


This addition theorem enables us to get the following expressions for nE(p) — 
E(p n ), where F{p n ) - nF(p ): 

2E{p) — E{pf) = k 2 sin</ 2 sin(^sin<^ 2 , 

3 E(p) - E{p 3 ) = (2 E(p) - E{p 2 )) + (E{p 2 ) + E(p) - E(p 3 )) 

= k 2 sint/ 2 (sin<^sin (^2 + sin p 2 sin <£> 3 ), 

etc. In Chapter 3 we will apply the formulas obtained in this section to solve various 
problems on arc lengths of an ellipse. 


§2.8. The elliptic Jacobi functions 

The function considered in the preceding section 


F(f) = f 

Jo 


dx 


y/(l — X 2 )(l — k 2 x 2 ) 


rip 

Jo 


dp 


V 1 — k 2 sin 2 p 

where x = sin p, is in many ways analogous to the function 


arcsm x 
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where x — sintp. For example, if F((p) + F(-0) = F(/j,), then sinp, is expressed in 
terms of sin cp and sin '0 by the formula 

, . . sinipcos^A^) + sin0cos-0A(<p) 

(8.1) sin/i - . 2 . 2 / ’ 

1 — k z sin <p sm 0 

and if arcsin x + arcsin y — arcsine, where x — .sincp , y = sin-0, and z — sinp , 
then 


(8.2) sin/i = sin cos 0 + sin0costp. 

This analogy is not accidental: for k = 0 the function F(cp) becomes the function 
arcsin x — <p and formula (8.1) turns into (8.2). 

For many reasons it is more convenient to consider not the function tp = arcsin x 
but the inverse function x — sin <p. The function inverse to the function F(<p) is 
also in many respects more convenient than the function F((p) itself. Let us replace 
Legendre’s notation F(<p) with the Jacobi notation: set u(<p) — F(<p). The function 
<p(u) inverse to u(tp) is called the amplitude of u and is denoted by <p = am u. In 
the preceding section we obtained formulas (7.5)-(7.7) for the functions sin p, cos <p 
and A(<p) — y/l — k 2 sin 2 <p. Introducing the functions 

snu = sin am u, cn u — cos am u and dn u — A(am u) 


we may express formulas (7.5)-(7.7) in the following way: 


(8.3) 

cn(u + v) = 

(8.4) 

sn(u + v) = 

(8.5) 

dn(u + v) = 


cn u cn v — sn u sn v dn u dn v 
1 — k 2 sn 2 u sn 2 v 
sn u cn v dn v + sn v cn u dn u 
1 — k 2 sn 2 u sn 2 v 
dn u dn v — k 2 sn u sn v cn u cn v 
1 — k 2 sn 2 u sn 2 v 


The functions snu, cn u and dn u are usually called elliptic Jacobi functions al¬ 
though many properties of these functions had been established, to some extent, 
by Legendre and Abel before Jacobi. 

One of the most important properties of the functions snu, cnu and dnu is 
their double periodicity. The existence of one period for these functions is quite 
obvious. Indeed, the functions sin <p and cos <p have period 2 tt — 4(|) and the 
function sin 2 <p has period 7r = 2(|). Therefore, the functions snu and cnu have 
period 4 K, where 




dip 


yfT-k 2 


sm <p 


and the function dnu = \/l — k 2 sn 2 u has period 2 K. 

With the help of the addition formulas (8.3)-(8.5) we can figure out how the 
functions sn u, cn u and dn u behave if the argument is increased by a quarter of the 
period, K , and by a half period, 2 K. Substituting 


snK = 1, cnK — 0 and dnA — yj\ — k 2 


into (8.3)~(8.5) we get 


, , cnu , _ _. 

sn(u + K) = - -, cn(u + K) 

dnu 


\/l — k 2 snu . . y/l — k 2 

-, dn(u + K) = -- 


dnu 


dnu 
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Since sn2 K = 0, cn2iF — ~1 and dn2 K = 1, it follows that 

sn(n + 2 K) — -sn«, cn (u + 2 K) = —cnu, dn(a + 2 K) = dn u. 

It is somewhat more difficult to figure out what the other period of the elliptic 
Jacobi functions is. Let us first recall that the integral 


dip 


was obtained from the integral 


-\/l — k 2 sin 2 


dx 

y/(l — X 2 )(l “ k 2 x 2 ) 


with the help of the change of variables x = sin ip. It will be more convenient now 
to work with the original integral. On the complex plane C, the function 


u(x) = [ 
Jo 


dx 


i/(l — x 2 ){l — k 2 x 2 ) 


is not defined in general because the value of u(x) depends on the contour of 
integration. The values of the function u(x) at the same point can differ by numbers 
of the form 

L= [ 

JC y/{l ~ X 2 )(l - k 2 X 2 ) ’ 

where the integral is taken along a closed contour C. Here any such number L is a 
period of the inverse function x(u). The integrand has singular points ±1, ±k~ J . 

Let us see how the choice of the path around th ese points affects the values of 
the functions sn u — x, cnu = y/ 1 — and dn u — y/l — k 2 x 2 . 

Let 

(•1 px 


dx 


= K and 


/' 


dx 


\J (1 — x 2 )(l — k 2 x 2 ) 


= a. 


i o \/(l — cc 2 )(1 — k 2 x 2 ) 

The path shown in Figure 22 shows that the values of the integral at the point 
X are equal to a and K + (K - a) = 2 K - a. Indeed, on the last part of the 
path both the sign of the function y/l - x 2 and the direction of the segment of 
integration change; as the result of two such changes the sign of the integral does 
not change. Therefore, sna = sn(2iL - a). Furthermore, the_sign of the function 
y/l — x 2 changes under this passage and the sign of y/\ — k 2 x 2 does not change. 
Hence, cna = - cn(2iL — ck) and dna = dn(2.fr — a). Replacing a with —a we get 

sn(o; + 2 K) = sn(— a) = -sna, 
cn(o; + 2 K) = - cn(—a) = - cna, 
dn(a + 2 K) = dn(—a) = dna. 

We have already obtained these formulas by another method. 


0 




i 

=0 


Figure 22 
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1 


Figure 23 


Now, let us consider the passage along the curve shown in Figure 23. Let 



dx 

y/l — x 2 \/l — k 2 x 2 



(The number K' is real since the number \/l — x 2 is purely imaginary for x E 
(1,A: _1 ).) Therefore, the values of the integral at point X are equal to a and 
K + %K' + iK' — (K — a). Here only the sign of the last summand needs elucidation. 

Observe that there occurred three changes of sign: the direction of the inte- 
g ration con tour had changed, and so did the signs of both functions \/l — x 2 and 
y/l — k 2 x 2 . As a result, we get 


sna = sn(o: + 2 iK'), 
cn a = — cn(a + 2 iK'), 
dn a — — dn(a + 2 iK'). 


Therefore, the functions snu, cn u, and dn u have periods 2 iK', 4 iK', and 4 iK', 
respectively. Moreover, the function cn u has a period 2 K + 2 iK' . Indeed, 

cn(ct + 2 iK’ + 2 K) — — cn(a + 2 iK') = cna. 

Thus, the function snu has periods 4 K and 2 iK; the function cnu has periods 4K 
and 2 K + 2iK'\ and the function dn u has periods 2 K and 4 iK'. 


Problems r 

2.8.1. Prove that K' — f* , . dt .^. , where k' 2 + k 2 = 1. 

Hint. Make the change k't = y/l — k 2 x 2 , where k' is chosen so that ky/x 2 — 1 
— k'\/l — t 2 . 

§2.9. The Weierstrass theorem on functions possessing 
an algebraic addition theorem 

We will say that a meromorphic function ip(z) possesses an algebraic addition 
theorem if there exists a nonzero polynomial F in three variables such that 

F(ip{zx + Z 2 ), <p(zi), y{z2)) - 0; 

this identity means that (p(zi + z 2 ) can be algebraically expressed in terms of ip{z\) 
and <p{z2). 

For example, the function sn z possesses an algebraic addition theorem. Indeed, 
if a — sn( 2 i + z 2 ), b = sn^i and c = sn^, then 

bVT^yfl — k 2 c 2 + cyf 1 — b 2 V 1 — k 2 b 2 


a — 


1 — k 2 b 2 c 2 
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Squaring twice we can get rid of the radicals and get a polynomial relation 


F(a, b, c) — 0. 

The Weierstrass function p(z) also possesses an algebraic addition theorem. 
Indeed, if a = p{z\ + z 2 ), b — p{z\) and c — p(z 2 ), then 


7 , 1 ( V 4b ~ 92 b - 93 - \/4c 3 - g 2 c- gs 

a - -b-c + - - -7-—- 

4 \ b-c 


By simplifying this formula we get a relation of the form F(a , 6, c) = 0, where F is 
a polynomial. 

As we saw in §2.3, any elliptic function / can be represented in the form 

/ = R{p) +Ri(p)p', 

where R and R\ are rational functions (see §2.3). This representation enables us to 
get an algebraic addition theorem for an arbitrary elliptic function if we take into 
account that 

(p') 2 = 4p 3 - g 2 p - g 3 - 

Indeed, if i?i ^ 0, then p' — • Hence, 


f-R(p ) ) 2 
Ri(p) ) 


- 4p3 — g 2 p — g 3 - 


This relation can be rewritten in the form P(/, p) = 0, where P is a polynomial 
(its degree with respect to / is equal to 2). If Pi = 0, the relation / = R(p) 
can also be expressed in the above form. Let A — f{zi+z 2 ), B — f(zi), and 
C — f(z 2 )- Prom the relations F(a, b, c) = 0 and P(A, a) — 0 we can get a relation 
F\ (A, b, c) = 0 by calculating the resultant of the polynomials f(a) = F(a , 6, c) and 
g(a) = P(A, a). 

Next, from the relations Fi(A,b,c) — 0 and P(B,b) = 0 we get F-iiA, B, c) = 
0 and from the relations F 2 (A, B,c) — 0 and P(C,c ) = 0 we get a relation 
G(A, B, C) — 0, as required. 

Not only elliptic functions possess algebraic addition theorems. For instance, 
the exponential function e z possesses the algebraic addition theorem, since e Zl+Z2 — 
e Zl e Z2 . Moreover, if u = R{f), where R is a rational function, then P(u,f) = 0, 
where P is a polynomial (linear in u). Hence, from the relation F(a, b, c ) = 0, where 


F(a,b,c) = 


a — (b + c) if f = z, 
a-be if / = e Xz , 


we may get, as above, a relation G(A,P, C) — 0, where A — R(a), B — R(b ), and 
C — R(c). Therefore, any rational function and also any rational function in e Xz 
possesses an algebraic addition theorem. 

It turns out that the above examples exhaust all meromorphic functions pos¬ 
sessing an algebraic addition theorem. 


Theorem (Weierstrass). Any meromorphic function ip(z) possessing an al¬ 
gebraic addition theorem is either an elliptic function or is of the form R(z) or 
R(e Xz ), where R is a rational function. 
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Proof (W. S. Osgood). In the finite domain the meromorphic function has no 
singular points other than poles. If the limits 

lim <p(z) or lim l 

z~~*oo Z ~~*OO tp{z) 

exist, then the function cp(z) is rational. Indeed, let us subtract from <p the sum of 
its principal parts at all the poles (if the point oo is a pole, then the principal part 
of the function at this point is of the form a r z r + a r+1 z z+1 + • ■ •, where r > 0). 
As a result, we get a function / without singular points; the point oo is also a 
nonsingular one. Therefore, / is a constant and the initial function ip is rational. 

In what follows we will assume that the function ip is not rational, i.e., the point 
oo is an essential singularity. To prove the Weierstrass theorem, we will need the 
following theorem on the behavior of a function in a neighborhood of an essentially 
singular point. 

Picard’s big theorem. Any analytic function ip(z) assumes in an arbitrary 
neighborhood of an essentially singular point any finite value except, perhaps, one 
value. 

Proof of this theorem can be found, e.g., in [B16], 

Let F(<p(zi + Z 2 ), (p(zi), (p(z 2 )) = 0, where F is a polynomial whose degree with 
respect to the first argument is equal to n. We have to prove that ip is a periodic 
function and if it is not doubly periodic, then (p(z) = R(e Xz ). Picard’s big theorem 
implies that in a neighborhood of an essentially singular point the function ip takes 
a certain value c infinitely many times. 

Let at, ... , a n+ i be points at which <p takes value c. The singular points of ip , 
together with the points z for which the points z + a* are singular, form a set of 
measure zero. Therefore, there exists a nonsingular point Zq of the function ip for 
which all points ai + zq are also nonsingular ones. Then the points 2 and a t + 2 for 
values £ sufficiently close to zq are also nonsingular. For such points 2 consider the 
equation 

(9.1) F(x,ip{z),c) = 0. 

It has n + 1 roots Xi = ip(z + a*), because 

F(ip(z + o*), ip(z), c) = F(ip(z + a*), <p(z), <p(a;)) = 0. 

Equation (9.1) is a nonzero polynomial in x of degree n and, therefore, it has 
at most n distinct roots. It follows that <p{z + a p ) = ip(z + a q ) for certain distinct 
p and q. Such a relation is satisfied for any point z from a neighborhood of zq, but 
the pairs (p, q) can differ. 

Nevertheless, there are only finitely many pairs (p, q) and, therefore, some 
relation ip(z + a p ) — <p(z + a q ) holds for an infinite set of points 2. These points 
have a limit point z\ and the function <p is regular at z\. By the uniqueness theorem 
we see that the functions <p(z + a p ) and ip(z + a q ) coincide, i.e., a p — a q is a period 
of ip. 

We have proved that <p is a periodic function; for definiteness, we may assume 
that the minimal period of <p is equal to 2tx. Let us assume that ip has no other 
periods and then show that <p(z) = R(w), where w — e lz and R is a rational 
function. The map z 1—> w = e iz sends the strip 0 < Re 2: < 2?r into the plane with 
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the cut from 0 to Too. The function ip(w) = ip(z) is meromorphic on the plane 
with points 0 and oo punctured. If these points are not essentially singular ones, 
then the function ip is rational. 

Suppose that at least one of these poles is essentially singular. Then by Picard’s 
big theorem there exist points bi, ... , b n+ 1 for which ip(bi) — c. The preimages 
j3i, ... , f3 n +i of these points with respect to the map z i—■> w are distinct and 
belong to the strip 0 < Re^ < 27r. The equation F(x,(p(z),c) = 0 has roots Xi — 
ip((3i + z). Repeating the same arguments as above, we see that the function ip has a 
period (5 P - @ q , where 0 < Re/3 P , Re f3 q < 2n. Therefore, either <p has another 
period in addition to the purely real period 2 it or it has a real period smaller than 
27 t. The latter case is impossible, since the period 2 n is the smallest one by the 
assumption. □ 



CHAPTER 3 


Arcs of Curves and Elliptic Integrals 


For a circle it is easy to construct an arc whose length is equal to the sum of 
lengths of two other arcs of this circle. Generally speaking, this is related to the 
fact that sin(<£ + ip) is expressed in terms of sin ip and sin ip by the formula 


sin(</? T ip) — sixupy 1 — sin 2 ip + sin^y 1 — sin 2 <p. 

For the elliptic integrals F(ip) and E(<p) there also exist addition theorems, al¬ 
though more cumbersome, as we have seen. This means that for curves whose arc 
lengths are expressed in terms of F(p) and E(ip) the operation of addition of arc 
lengths is also possible, although it is more involved; moreover, for E{p) it is not, 
strictly speaking, the straightforward addition, but an addition with a certain extra 
algebraic term. 

In this chapter we will study curves whose arc lengths can be expressed in terms 
of elliptic integrals. In many respects the lemniscate is the most interesting of all 
such curves. However, the lemniscate deserves special study and we will study it in 
the next chapter. 


§3.1. Arcs of the ellipse and the hyperbola 

2 2 

The ellipse ^ — 1 can be given parametrically by the formulas x — 

a cost/?, y = bsinp. The differential dl of the length of an arc on the ellipse is 
equal to a/ dx 2 + dy 2 = dps/ a 2 cos 2 <p + b 2 sin 2 <p. If a = 1 and b = a/ 1 — fc 2 , then 
dl — dpy/l — k 2 sin 2 tp. In this case the length of the arc on the ellipse between the 
end point of the small half axis, B, and the point M = (cost/?, bsinp) is equal to 
(see Figure 24) 

E(p) — j \Jl — k 2 sin 2 ijj dip. 
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Therefore, the length of an arc on the ellipse can be expressed in terms of an elliptic 
integral of the second kind. This is precisely the reason why the integral gets the 
name elliptic. 

2 2 

The simplest parameterization of the hyperbola — pr — 1 is obtained using 
hyperbolic functions x ~ a cosh t, y — b sinh t. To express the length of an arc on 
the hyperbola in terms of F(p) and E(p) we, however, need a parameterization of 
the hyperbola by trigonometric functions. One such parameterization is given by 
the formulas 

x — - ; y — btanp. 

COS ip 

Under such a parameterization the differential of the arc length is equal to 


cos 2 p 


a 2 sin 2 p + b 2 dp 


and this formula does not lead to the expression desired. Therefore, let us consider 
another parameterization setting y = b 2 tan p. Then 



(1 - b 2 ) sin 2 p). 


In particular, if a? — 1 — b 2 — k 2 we get 


x 


cos p 


1 — k 2 sin 2 p and y = (1 — k 2 ) tan p 


so that 


dl 


(1 — k 2 )dp 


cos 2 - k 2 sin 2 p 



Figure 25 


Hence, the length of the arc AM on the hyperbola (Figure 25) is equal to 
W (1 - A: 2 )# _ r (1 - k 2 )dip 


r (</>) = 


'0 


cos 2 ip\/\ — k 2 sin 2 ip Jo cos 2 ip A(ip)’ 


where A(^>) = -\/l - k 2 sin 2 Since A'(ip) — - -- - S ^ f~- S - — ? it follows that 


(A (ip) tan ip)' 


k 2 sin 2 ip A (ip) 


A (ip) 
1 -k 2 


1 — k 2 

A (ip) 1 cos 2 ip cos 2 ipA(ip) A (ip) 


+ A (ip) 



§3.2. DIVISION OF ARCS OF THE ELLIPSE 


55 


so that 


F(p) = A{p)ta.np- A(ip)dip + (1 - k 2 ) 

= A(</?) tan ip — E(p) + (1 — k 2 )F(p). 

Thus, the arc length of the hyperbola can also be expressed in terms of the elliptic 
integrals E(p) and F(p) (and the elementary function A(92) tan p), although this 
expression is more cumbersome. 


§3.2. Division of arcs of the ellipse 

Let us consider the ellipse 


where a — 1 and b = Vl — k 2 . The length of its arc : BM (Figure 26) is equal to 
E(p). In §2.7 we showed that if F(p) + F(ip) = F(p), then 

E(p) + E(ip) — E ( n ,) = k 2 sin ip sin ijj sin n , 

where tp, 'll), and p are subject to the constraint 


cos (p cos ip — sin p sin ip y 1 — k 2 sin 2 p = cos p. 

We do not repeat all the formulas from §2.7, although we will need many of them 
now to obtain relations between the lengths of the ellipse arcs of the ellipse. 



Figure 26 

Let us start with the study of the case p — tt/2. In this case simp = 1 and the 
angles <p and ip are related by the formula 6 tan </? tan ip = 1. Let the points M and 
N correspond to the angles p and ip (see Figure 26). Denote the length of the arc 

BM by BM. Then 

E(p) = BM, E(ip) = BM.+ MN, E{p) — BM + MN + NA. 
Therefore, 

E(p) + E{ip) - E(p) = BM - NA. 

As a result, we get the following statement. 
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Theorem (Fagnano). Let the angles (p and ip be related by the formula 


bt&rupt&nip — 1, 

and let M and N be the points of the ellipse corresponding to these angles. Then 

S -\ /•-\ 

BM — NA = k 2 sin <p sin ip. 

(Recall that a — 1 and b = y/l — k 2 .) 

Let us consider in detail the case when the points M and N coincide, i.e., 
(p — ip = 9. Then tan 2 9 = b~ l and sin 2 0 = (1 + b ) -1 . Hence, BM — AM = 
k 2 sin 2 6 = 1 — b and BM + AM — E 1 — E(^) is the quarter of the arc length of 

the ellipse. Therefore, BM = | E 1 + |(1 - b) and AM — |J5 1 —1(1 — 6), i.e., up to 
an extra algebraic term |(1 — 6) the point M divides the arc by half. The presence 
of the extra term k 2 sin ip sin ip sin p in the addition theorem for elliptic integrals of 
the second kind leads to this, not quite natural, division “by half”. 

A similar division of the arc “by half’ can also be performed not only for the 
quarter of the arc of the ellipse but for an arbitrary arc '-'BM. If p = ip — 9 and 
F(p) + F(ip) = F{p), then 

2 E(Q) — E(p) — k 2 sin 2 6 sin p, 


where the angles 6 and p are related by the formula 

cos 2 0 - sin 2 9A(p) — cos p. 


To find the angle p from a given angle 9 (this problem is called the duplication of 
the arc ) we can make use of any of the formulas 


sin p = 


2 sin 6? cos (9 A (0) 
1 — k 2 sin 4 9 


or 



tan 9. 


To find the angle 9 starting with a given angle p (the division of the arc “by half” 
we can use the formula 


sin 2 9 = 


1 — COS p 

1 + A (p) 


) 


The equation 

2E(9) — E(p) ~ k 2 sin 2 9 sin p 


means that 

2 BM — BN = k 2 sin 2 9 sin p, 

where the points M and N correspond to the angles 9 and p. Hence, 


BM ^ -BN F -k 2 sin 2 
2 2 




Suppose we want to construct an arc '-'MN whose length should be equal to a 

half of AB (Figure 27). Then the problem of the division of the arc can be solved 
explicitly without an extra algebraic term. 

First, construct the point K that divides '-'AB “by half” and is expressed with 
an extra algebraic term (see above). For the angle 9 corresponding to K we have 

sin 2 9 = (1 + b)~ x and E(9) = i E 1 + ~(1 — b). 
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Figure 27 


For the angles p and ip the relation 

E(i>) - E(<f) = i E 1 

should be satisfied. Using the freedom in the choice M, let us impose on the angles 
ip and ip an additional constraint 

F(<p) + F{6)-F(iP) = 0 

from which it follows that 

E(ip) + E{9) — E{ip) = k 2 sin p sin ip sin 6 


and 

Then 


cos ip cos ip + sin p sin ip A ( 9) = cos 9. 

^ E 1 = E(ip ) — E(ip) = E{9) — k 2 sin <p sin ip sin 9 

Li 

— + ^(1 — b) — k 2 sin <p sin ip sin 9, 

z z 


so that 


k 2 sin p sin ip sin 9 


- 2 d-b). 


Taking into account that k 2 — 1 — b 2 and sin 2 9 — (1 + b) 1 , we get 
sin p sin ip 


(1 — 6 ) VTTb 


2(1 - b 2 ) 2 VlTb 2 


sin0. 


Since A(0) = \/b, the relation 

cos p cos ip + sin p sin ipA(9) = cos 9 
can be rewritten in the form 

1 

cos p cos ip + - sin 9\fb = cos 9. 
z 

But sin 9Vb = cos 9] hence, cos p cos ip — \ cos 9. As a result, we get equations for 
p and ip: 


1 1 

sin p sin ip = - sin 9 and cos p cos ip = - cos 9. 

Li Li 


1 7T / 7T\ 

cos (p ±ip) = - (cos 9 =F sin 9) — cos — cos ^ ± — J 


It follows that 
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These formulas enable us to find p and ip. Moreover, from the same equations we 
can also derive expressions 

sin ip — - V3 -I- 4 sin 9 + 2 sin 2 9 — ^ a/ 3 — 4 sin 0 + 2 sin 2 9, 

sin ip = - a/ 3 + 4 sin 6 + 2 sin 2 6 + - \/3 — 4 sin 6 1 + 2 sin 2 0. 

4 4 

These expressions show that given an ellipse in which the major and minor semi¬ 
axes OA and OB are drawn, then with a ruler and compass we can construct an 
arc of the ellipse whose length is equal to a half length of the arc AB. 

Should the semi-axes not be given, they still can be constructed with a ruler 
and compass. 

This can be done, for example, as follows. First, let us construct a pair of 

conjugate diameters of the ellipse. To this end we can use the fact that the 

midpoints of parallel chords of the ellipse lie on one diameter AA' of the ellipse, 
while the midpoints of chords parallel to A A' lie on the diameter BB' conjugate 
to A A'. Let 0 be the center of the ellipse. Let us draw the perpendicular from B 
on the straight line OA and on the perpendicular mark points P and Q such that 
BP — BQ — OA. Then the bisectors of the angles between the straight lines OP 
and OQ determine the principal axes of the ellipse we were looking for. 

Indeed, let us consider the coordinate system whose axes are directed along 
the principal axes of the ellipse. Then the coordinates of the points A and B are 
(a cos p, b sin p) and (asmp,—bcosp), respectively. Therefore, the coordinates of 
the points P and Q are ((a+ 6) sin</?, — (a + b) cos ip) and ((a — b)smp, (a —&) cosp ), 
respectively, It is clear now that the bisectors of the angles between the straight 
lines OP and OQ are the coordinates axes. 

Now, let us investigate the division of an arc of the ellipse into three equal 
parts. First of all, consider the division with an extra algebraic term. Recall that 
if F(ip) — 3 F{p), then 

Sx — 4(1 + k 2 )x 3 + 6k 2 x 5 — k 4 x 9 

S1H W = ----_ 

1 — 6k 2 x 4 + 4k 2 (l + k 2 )x 6 ~ 3k 4 x 8 ’ 
where x = sin p\ for ip ~ ~ this equation reduces to the equation 

(2.1) 1 — 2x + 2 k 2 x 3 — k 2 x 4 = 0. 

If F((p 2 ) = 2 F(p) and F(p 3 ) = 3 F(cp) (i.e., p% — ip), then 

3 E(ip) — E(ip ) = k 2 sin <£>(sin p sin p 2 + sin p 2 sin <^3) • 

In order to get rid of the extra algebraic term on the right-hand side and solve 
the problem of the division of an arc of the ellipse into three equal parts in the 
strict sense, let us again construct the arc with an arbitrary beginning point. Let 
us confine ourselves to the division of the arc AB into three equal parts, where OA 
and OB are the major and minor semi-axes, respectively. In this case ip ~ p 3 = f; 
hence, sin p% = 1 and 

3 E(p) — E(ip) = k 2 sin p sin p 2 (sin p + 1), 

i.e., 

E(p) — ^-E l + sin p sin p 2 (sin p + 1). 

O O 


(2.2) 
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Let x = sin ip be a root of equation (2.1). Let us find the angles ip and u> satisfying 
equations 

(2.3) E(u>) - E(i>) = ±E\ 

(2.4) F(ip) + F(ij,) - F(w) = 0. 

From (2.4) it follows that 

E(ip ) + E(ip) — E(u) = k 2 sin ip sin ip sin u. 

Taking (2.2) and (2.3) into account, we get 

(2.5) sin ip sin u = ^sin</? 2 (l + sin <p). 

O 

Moreover, it follows from (2.4) that 

cos ip cos u + sin ip sin uA(<p) = cos ip 

and, therefore, 


cosip cosu = cos^> — ^ sin</? 2 A(t/?)(l + sint^). 

O 


Making use of the relation 

cos p >2 cos + sin <p 2 sin <ps A(p) = cos ip, 


where ips = |, we obtain sin <p 2 — cos <p/ A{<p). Taking into account relation (2.1) 
with x — sin p, it is also possible to express this relation in the form cos y? 2 = 
1 — sin^; to this end we should express (2.1) as 


1- 


1 - x 2 
1 — k 2 x 2 


= {l-xf. 


Hence, 


cos ip cos lo — cost p — ~ cos<p(l + sin 9?) = cost^(l + cos<p 2 ). 

O o 


The last relation together with (2.5) leads to the following formulas: 


cos( , i p ± w) = - (cos + cos p cos ip 2 T sin p 2 sin <p sin ip 2 ) 
o 

= -x{cosip T sin</? 2 T cos {ip ± ip 2 )). 
o 


Thus, if a root of equation (2.1) can be constructed with a ruler and compass, 
then for any ellipse with the ratio of its axes equal to 1 : \/l — k 2 we can construct, 
using a ruler and compass, an arc whose length is equal to a third of the length of 
the arc ^ AB . An example of such an ellipse is an ellipse with the ratio of axes 
1 : \/2 for which k 2 = ~. Indeed, in this case equation (2.1) is of the form 

x 4 - 2x 3 + 4x - 2 = 0. 
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Making the change of variables x = \J\~z we get the equation 

2(1 + z)Vl^z = 2 - (1 - z) 2 . 

Squaring both sides we get 

z 4 + Gz 2 — 3 = 0, i.e, z 2 = -3±2v / 3. 

Finally, z = x/pp'l - 3 and sin p = x = yl — z, i.e., the angle -p can be constructed 
with a ruler and compass. 

Remark. The whole material of this section is borrowed from the classical 
treatise by Legendre [A12]. 


Problems 

3.2.1. Let the angles p and ijj be related by the formula b tan p tan ip = 1. By 

Fagnano’s theorem BM - NA = k 2 sin p sin ijj. Prove that this difference is also 
equal to the length of the segment MM\ as well as to that of NN \, where M\ and 
N\ are the projections of the center of the ellipse on the tangent lines at the points 
M and N (see Figure 28). 



Figure 28 


§3.3. Curves with elliptic arcs 

The ellipse is an example of a curve whose arc lengths can be expressed in 
terms of an elliptic integral of the second kind. For a curve whose arc lengths 
are expressed via elliptic integrals of the first kind the addition of arc lengths can 
also be performed even without an algebraic extra term. Are there curves whose 
coordinates are sufficiently simple functions of a parameter <p and whose arc length, 
as a function of ip, is an elliptic integral of the first kind? 

Consider the curve x — x{<p), y = y{p) such that the length of the arc from the 
point with parameter 0 to the point with parameter p is equal to 

,, r dp 

PM = / 7l vrrT z- 

Jo v 1 - k 2 sm p 


For k — such a curve was known already to Fagnano; this is the lemniscate. 
Legendre attempted to construct an example of such a curve for an arbitrary k. 


If 


dx — 


cos p dp 
1 ~ k 2 sin 2 p 


and dy 


—b sin p dp 
1 — k 2 sin 2 p ’ 
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where b — y/l — fc 2 , then 

dx 2 + dy 2 = ff . 2 = () , where A(y>) = \jl-k 2 sinV- 

1 — /c 2 sin 2 </? \ A(v?)y v 

Therefore, as the functions t(c/?) and y(<p) we may take primitives of the functions 

cos ip , —6sin</? 

-_ and - ; _.o — • 

1 - k 2 shr if 1 - k z sm ip 

The corresponding integrals are easy to calculate using the change of variables 
u = sin <p and v = cos ip. As a result, we get 

_ 1 1 + ku __ 1 + k sin ip 

X ~ 2k n 1 - ku 2k n 1 - fcsintp’ 


y 


It is easy to verify that 


1 kv 

— arctan — 
k b 


cos ky 


1 k cos ip 

— arctan ——-—. 
k b 


y/l — k 2 sin 2 <p 


and 

g kx _j_ g —kx 2 

cosh kx = -—-= — 7 = = t= 5 

2 y/l - k 2 sin 2 ip 

i.e., cosky — bcoshkx. 

Legendre found this example easily but he was not satisfied since the curve 


cos ky = b cosh kx 

is not an algebraic one. He also succeeded in constructing an example of an algebraic 
curve with the arc length equal to F(<p) (with a certain algebraic extra term). 

These studies of Legendre led mathematicians to the problem of finding all 
algebraic curves whose arc length is an elliptic integral of the first kind. Legendre s 
problem was solved by the Trench mathematician Joseph-Alfred Serret. In three 
papers published in Liouville’s Journal de l’Ecole Polytechnique he managed to 
construct a family of plane algebraic curves S p depending on a positive rational 
parameter p whose arc length is exactly F(<p). (For an exposition of these papers 
see [A14] , §§563-565.) Moreover, Serret proved that in this way one gets all the 
algebraic curves with required properties. 

Serret’s curves are obtained in the following way. Let p be a fixed number. 
Consider the triangle OPM with the length of the sides OP and PM equal to ^fp 
and \Jp + 1, respectively, and the angles at the vertices O and M equal to a and 
fj, respectively. Let 

(3.1) cosu = cos (pa — (p + l)/3). 

Let us introduce a Cartesian coordinate system Oxy with the origin at the vertex 
O of the triangle OPM and the angle between the T-axis and the side OM equal 
to u (Figure 29). 
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Figure 29 


Let us vary the triangle OPM so that the point O is fixed, the lengths of the 
sides OP and PM are constants and the angle u) between the cc-axis and the side 
OM is determined by relation (3.1) at all times. Then the point M will plot a 
certain curve S p . Let x = p cos to and y = p sin u be the coordinates of M; we may 
assume that x and y depend on a parameter a. 


Theorem (Serret). The length of an arc of S p as a function of the parameter 
a is equal to 

< v/1-lW Whm k = ^' 

The curve S p is algebraic for any positive rational p. 

Proof. Let OM — p. By the law of cosines p+l=p + p 2 — 2py / pcos a and 
p = p + l+p 2 - 2 py/p+ lcos/3, i.e., 


(3.2) 


cos a; = 


P 2 - 1 
2 P^P 


and 


cos /? = 


p 2 pl 

2 p\/p + 1 ’ 


Thus, p and cos cn are related by a (polynomial) constraint F(cos a, p) = 0, where 
F(x,y ) = 2 xy^fp — y 2 + 1. Moreover, if p € Q, then cos a; and cos pet are also 
connected by a polynomial relation. It follows that p and cospa are related by a 
polynomial dependence. Similarly, p and cos(p + 1 )(3 are related by a polynomial 
dependence. Since x = pcos uj = pcos(pa — (p + l)/3) and y = psince, it follows 
that x and p, and also y and p, are related by polynomial dependencies. Therefore, 
x and y are related by a polynomial dependence, i.e., the curve S p is an algebraic 
one. 


It follows from (3.2) that 


sin a = 


R 

2 Py/P 


and 


sin/? = 


R 

2 py/p + 1 ’ 


where 


R= V-p 4 + 2(2p + l)p 2 -l. 
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Differentiating the equality cos a; — cos (pa — (p + l)/3) we get 

— sinLodu = — sin(pa -- (p+ l)/3) (pda — (p+ 1) dp). 
Since sin a; = ±sin(pa — (p + l)/3), it follows that 

±du — pda - (p + 1 )dj3. 


Making use of the fact that 

o 2 — 1 


cos a 


and dcosa 


2 pjp 

ir~y 

±du = pda — (p + 1 )dfi — 


sin ada 


R 


2 P^/P 


.da 


we get da = Similarly, dp = Therefore, 

p 2 - ( 2 p + 1 ) dp 
i? P 


Let dl be the differential of the arc length. Then 

dl 2 = dp 2 + p 2 dco 2 = 4p{jp + l)~^f) 


i.e., dl = ±2 v / p(p+ 1)^. Since ^ = -2^+1$, it follows that ±dZ = y/p^- 
Moreover, sin/3 = ,^/^sina; hence, cos/3 - y/l-k 2 sin 2 a, where k = 

Finally, up to a sign, we get 


as was required. 


da 


dl ~' 3 yi-fcwT 


□ 


The curve S p possesses the following remarkable property. Let the points 0, 
A, and B correspond to the values of the parameters 0, a, and /?, respectively. 

Consider a point 0 such that 00 = OA + 05. In other words, 0 corresponds 
to the value of the parameter 7 such that F( 7 ) = 5(a) + 5(/3). Then cos 7 can 
be algebraically expressed in terms of cos a and cos /3. Moreover, it is also clear 
that the coordinates of the points A, B, and 0 can be algebraically expressed in 
terms of cos a, cos/3, and cos 7 , respectively. Hence, the coordinates of C can be 
algebraically expressed in terms of the coordinates of A and B. Thus, the addition 
and the division of arc lengths on the curve S p are algebraic problems. In particular, 
we may write an equation for the division of an arc (with the beginning point at 
0) of the curve S p into n parts of equal length. 

Now, let us consider in more detail the curve S p for p = 1. In this case 


cos a = 



and 


cos/3 = 


p 2 + 1 

2V2p ' 


Easy calculations show that 

x — pcos(a — 2/3) 


1 


4p 2 


+1 and y — 


R( 1 -p 2 ) 

4 p 2 


■■•sre 


J 
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where R = ^/— p 4 + 6p 2 — 1; hence, x — 1 = pi cos/3 and y = 

2 i 

pi — . Moreover, 


cos 2/3 = 

i.e., the curve is the lemniscate. 



—pi sin /3, where 


§3.4. Curves whose arc lengths can be 
expressed in terms of arc lengths of the circle 


In the preceding section we constructed a series of curves for which the addition 
of arc lengths is an algebraic operation. There are other examples of such curves. 
It was Euler who already constructed them. He wrote that he had found these 
curves only after long study. The coordinates of points on these curves are algebraic 
functions of the argument tan s, where s is a parameter proportional to the length 
of the arc of the curve. The addition of arc lengths of such curves is an algebraic 
operation since tan(a + (3) can be algebraically expressed in terms of tan a and 
tan /3. 

Later on, Serret generalized the examples constructed by Euler. Serret even 
gave a complete classification of such curves. We will confine ourselves to the 
simplest example. 

Let x + iy = » w ^ ere n is an integer or a rational number, and a and 

a are complex numbers such that a — a. As t varies from —oo to +oo, the points 
with coordinates (x , y) form a curve. This curve is an algebraic one, because x and 
y were algebraically expressed in terms of t. 

It is easy to verify that 


dx + idy 


(t — a) n+1 (pt + q) 
(i t-a) n +'(t + i) 3 


It is possible to select numbers a and a = a so that pt + q — k(t — i). We can even 
assume that |a| = 1, namely, 

y/n(n + 2) i 
n + 1 n + 1 
For this value of a we get pt + q — k(t — i), where 

2 ^/n(n + 2 ) 


a 


In this case 


Therefore, 


Hence, 


k 

dx — idy — k 
dl 2 = dx 2 + dy 2 — k 2 


n+ 1 

(t — a) n+1 (t + i) 
(t - a) n+1 (t - i) 3 

dt 2 


l = ±k 


(■t + i) 2 (t — i) 2 
1 dr 


dt. 


k 


dt 


t 2 + 1 


10 


r 2 T 1 


±k arctan t. 


Thus, if the parameter s is equal to £ of the arc length of the curve, then t — ± tan s 
and the coordinates x and y of the points of the curve can be algebraically expressed 
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in terms of t. The lengths of arcs are measured from the point corresponding to 
the parameter t — 0. 


Problems 


3.4.1. Prove that for n = 1 the equation of the curve considered above can be 
expressed (after a homothetic transformation) in polar coordinates in the form 



CHAPTER 4 


Abel’s Theorem on Division of Lemniscate 

A lemniscate is a curve whose equation in the polar coordinates is of the form 
r 2 — cos 2(9 (Figure 30). The name “lemniscate” stems from the Latin word lemnis- 
catus — decorated with ribbons. In Cartesian coordinates (x,y), where x = r cos 
y — r sin 9, the equation of this curve is 

(x 2 + y 2 ) 2 = x 2 -y 2 . 

Indeed, x 2 -f y 2 — r 2 and x 2 — y 2 — r 2 cos 26. 



Figure 30 

The French astronomer of Italian origin Jean-Dominique (Giovanni Dominico) 
Cassini (1625-1712) was the first to study the lemniscate. He considered even 
more general curves for whose points the product of the distances to the two fixed 
points Ft and F 2 is a constant (Figure 31). 



Figure 31 

On the basis of astronomical observations Cassini believed that with the help 
of such curves the movement of planets could be described more precisely than with 
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the help of ellipses. Now these curves are called Cassini’s ovals. But Cassini’s book 
Elements d’astronomie (Foundations of Astronomy), in which they were studied, 
was published in 1749, many years after his death. To the mathematical community 
the lemniscate became known through papers by J. Bernoulli and I. Bernoulli 
published in 1694 and, therefore, it is usually called Bernoulli’s lemniscate. 

The most remarkable properties of the lemniscate were discovered by an Italian 
mathematician Count Fagnano (1682-1766). By the way, it was Fagnano who 
coined the term elliptic integrals. Fagnano discovered that the arc length of the 
lemniscate can be expressed in terms of an elliptic integral of the first kind. He 
obtained an addition theorem for this integral and, therefore, demonstrated that 
the division of arcs of the lemniscate into n equal parts is an algebraic problem. 
In 1750 Fagnano published a collection of his papers under the name Produzioni 
matematiche. The Berlin Academy asked Leonard Euler to write a review of 
this book. Fagnano’s works stimulated Euler’s interest in elliptic integrals. In his 
numerous studies Euler considerably developed and generalized Fagnano’s methods 
and results. 

Already Fagnano knew that the division of the lemniscate could be reduced to 
a solution of an algebraic equation. However, methods for investigating solvability 
of equations in quadratures (i.e., via square roots) were not yet developed at that 
time. The first to achieve essential progress in this field was 19 year-old Gauss 
(in 1796). He found that a regular 17-gon can be constructed with a ruler and 
compass, i.e., the equation x 17 — 1 = 0 is solvable in square roots. Later, Gauss 
showed that with a ruler and compass one can construct a regular n-gon for any 
n of the form 2 a pi • ■ - pk, where p t are distinct Fermat primes, i.e., the primes of 
the form 2 2 ™ + 1. Gauss wrote that for all other n it is impossible to construct a 
regular n-gon with a ruler and compass but we have no substantiation of the claim 
that he could actually prove this. 

Gauss was also interested in the equation for the division of the lemniscate. For 
example, he showed that an equation of 25th degree related to the division of the 
lemniscate into 5 equal parts is solvable in square roots. His arguments were based 
on the fact that 5 can be represented as the product of 2 + % by 2 — i (for details 
see §4.3). Gauss did not publish these investigations but in his book Disquisitiones 
Arithmeticae (Arithmetical Studies), which appeared in 1801, he mentioned that 
the methods he developed were applicable not only to trigonometric functions but 
also to the functions related to the integrals of the form J . 

This claim intrigued Abel. Abel investigated in detail the equation for the 
division of the lemniscate and proved that the lemniscate can be divided into n equal 
parts for all numbers n of the form 2 a pi • • - pk, where the p t are distinct Fermat 
primes. Abel considered this theorem as one of his most important results. It is 
contained in the second part of his large work Recherches sur les fonctions elliptiques 
(Studies on Elliptic Functions). Abel’s proof is quite long and complicated. 

Later, Eisenstein (1823-1852) obtained a simpler proof. In doing so he dis¬ 
covered certain interesting properties of polynomials related to the division of the 
lemniscate. Eisenstein’s proof is still quite cumbersome. 

Relatively recently Rosen (see [C14]) found an elegant proof of Abel’s theorem. 
His proof is rather simple and, moreover, it distinctly demonstrates the decisive role 
played by the invariance of the period lattice of lemniscatic functions with respect 
to multiplication by i. 
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To help the reader to feel the style of Abel’s era and, at the same time, become 
acquainted with the modern interpretation of the subject, we will reproduce both 
proofs: the one that Eisenstein obtained one and a half centuries ago and the one 
recently found by Rosen. 

Abel only proved the possibility of the division of the lemniscate into n equal 
parts with a ruler and compass for the indicated values of n. He did not prove that 
for the other values of n this is impossible. In [C14] it is shown that for the other 
values of n it is impossible to construct the coordinates of the points that divide the 
lemniscate into n equal parts with a ruler and compass. This, however, does not 
mean that for the other values of n it is impossible to divide the lemniscate into n 
equal parts with a ruler and compass if the lemniscate is already drawn. Indeed, use 
of the lemniscate itself provides us with additional possibilities for constructions. 
Considering the points of intersection of the straight lines and circles with the 
lemniscate one can, in general, construct more than just quadratic irrationalities. 

* * * 


Before we plunge into the study of the equation for the division of the lemnis¬ 
cate, let us consider a simpler equation for the division of the circle. First, we will 
show how to solve in square roots the equation x 17 — 1 = 0 by a quite elementary 
method, though this solution cannot be generalized to the equation for the division 
of the lemniscate. Next, we will discuss the approach to the study of the solvability 
of the equation x n — 1 = 0 in square roots that can be generalized to the equation 
for the division of the lemniscate. 

§4.1. Construction of a regular 17-gon. 

An elementary approach 

The roots of the equation x n — 1 = 0 are the vertices of a regular n-gon. Indeed, 
if e = exp(27 ri/n), then s, e 2 ,... ,e n = 1 are the roots of this equation. Dividing 
the polynomial x n — 1 by x — 1 we get the polynomial x n ~ x + x n ~ 2 + • • • + x + 1. 
Thus, if the equation 

( 1 . 1 ) x n ~ 1 + x n ~ 2 + • • • + x + 1 — 0 

is solvable in square roots, then it is possible to construct a regular n-gon with a 
ruler and compass. 

For n = 3 there is no problem, since the quadratic x 2 + x + 1 = 0 is, without 
doubt, solvable in square roots. For n — 5 equation (1.1) is also easy to solve. 
Indeed, the substitution u — x + x~ l turns it into u 2 + u — 1 = 0. 

For n = 17 it is not that easy to solve equation (1.1) in square roots. To do 
so, Gauss used a special partition of the numbers e, s 2 , £ 3 , ... , e 16 into groups, 
where e = exp(27ri/17). To get such a partition, we enumerate the given numbers 
so that for a fixed l the root s k +i is obtained from e k in the same fashion, namely, 
by raising to a fixed power: e k+l = ( e k ) c : 

£k£l = £k+l- 

Such a numeration can be obtained by setting e k — e 9 ', where the residues of the 
numbers 1, g, g 2 , ... , <? 15 after the division by 17 take all values from 1 to 16. It is 
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easy to see that g = 3 possesses this property. For g = 3 the numbers e 0 , • ■ • , £15 
and their respective values are written one under another in the following table: 

£ £ 3 e 9 e 10 e 13 £ 5 e 15 e ll e 16 £ U £ 8 £ 7 £ 4 £ 12 £ 2 £ 6 

£q £i £2 £3 £4 £5 £6 £7 £8 £9 £10 £\1 £ 12 £13 £14 £15 

Let X\ be the sum of the numbers £k with even indices k, and X 2 the sum of 
the numbers £*, with odd indices k, i.e., 

X \ = £ + £ 9 -f £ 13 + c 15 + £ 16 + £ 8 + £ 4 + £ 2 , 

X 2 = £ 3 + £ 10 + £ 5 + £ n + £ 14 + £ 7 + £ 12 + £ 6 . 

The sum of all the roots of the equation x 17 - 1 = 0 (the root x — 1 included) 

is equal to zero, hence, x\ + X 2 = — 1. Simple calculations show that x\x 2 — —4. 
Indeed, let a = 2n/17. Then e k - cos ka + isinka-, hence, 

e + e 16 = 2cosa, £ 9 + £ 8 = 2 cos 8a, 

£ 13 + £ 4 = 2cos4o:, e 15 + £ 2 = 2 cos 2a, 

i.e., 

x\ = 2(cos a + cos 8a + cos 4a + cos 2a). 

Similarly, 

x 2 — 2 (cos 3a + cos 7a + cos 5a + cos 6a). 

Using the formula 

2 cospa cos qa = cos(p + q)a + cos (p — q)a 


we get 

X\X 2 = 8(cos a + cos 2a + cos 3a d-1- cos 8a) = 4(xi + x 2 ) — —4. 

Thus, we can find X\ and x 2 from the quadratic equation 
(1.2) x 2 + x — 4 = 0. 


Since 


cos a + cos 2a > 2 cos — = \Fi > — cos 8a 


and cos4a > 0, it follows that X\ > 0. Hence, X 2 — < 0, i.e., xi is the positive 

root of equation (1.2) and x 2 is the negative root. 

Denoting by yi, 1/3, y 2 and 1/4 the sums of the numbers £fc with indices whose 
residues modulo 4 are equal to 0, 1, 2 and 3, respectively, we get 

yi ~ e + £ 13 + e 16 + £ 4 = 2(cosa + cos4a), 
y 2 = e 9 + e 16 4- £ 8 + £ 2 = 2(cos8a + cos 2a), 
y 3 = £ 3 + e 5 4- e 14 4- £ 12 = 2(cos3a + cos 5a), 
y 4 = £ 10 + £ n + £ 7 4-£ 6 = 2(cos 7a + cos 6a). 


It is clear that y\ +y 2 — x\ and y\ > y 2 , because cos a > cos 2a and cos 4a > cos 8a. 
Moreover, 


y t y 2 = 4(cosa + cos4a)(cos8a + cos2a) = 2(cosa H-+ cos8a) — — 1. 

Therefore, y\ and y 2 satisfy the equation y 2 — xyy — 1 = 0. It is easy to verify that 
y 3 and 1/4 satisfy the equation y 2 — x 2 y — 1=0; moreover, y 3 > y 4 - 
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Finally, let us consider z\ — e + e 16 = 2 cos a and = e 13 + e A — 2 cos 4a, 
i.e., the sums of numbers Sk with indices whose residues after the division by 8 are 
equal to 0 and 4, respectively. Then z\ > Z 2 , Z\ + z<i — y\ and 

z\Z 2 = 4 cos a cos 4a — 2(cos 5a + cos 3a) = ys- 

Therefore, z\ is the largest root of the equation z 2 -y\z+y^ = 0 . Thus, the segment 
of length z\ — 2cos(27r/17) can be constructed with a ruler and compass. Now it 
is clear how to construct a regular 17-gon. 


§4.2. Construction of regular polygons. 

Elements of Galois theory 

In the preceding section we showed how to solve in square roots the equation 
x 17 — 1 = 0. Now we prove that for all numbers n of the form 2 n pi ■ ■ - pk, where 
the Pi are distinct Fermat primes, the equation x n - 1 = 0 is also solvable in square 
roots. Our exposition will be such that a good deal of it can be generalized to the 
case of the lemniscate almost without changes. 

Assigning to every real number t the point with coordinates (cost,sint), we get 
a parameterization of the unit circle C by real numbers. As a result, C turns into 
an abelian group with unit element ( 1 , 0 ). 

Since 


cos (t + s) = cos t cos s — sin t sin s and sin(t + s) = sin t cos s + cos t sin s, 
the law of addition of points on this circle can be expressed as follows: 

(a, b ) + (c, d) = (ac — bd , ad + be) = (/(a, 6, c, d),g{a, b, c, d)). 

It is easy to verify that 

2 (ar, y) = {x, y) + (x, y) = ( x 2 - y 2 , 2 xy) 


and 

3 {x,y) = (x 3 - 3xy 2 ,3x 2 y - y 3 ). 


Similarly, 


n(x,y) = {fn(x,y),g n {x,y)), 

where f n and g n are polynomials with integer coefficients, 
cos rup + % sin rup — (cos p + % sin p) n we get 


From the relation 


( 2 . 1 ) f n {x,y)- 


(x + iy) n + (t - iy) 1 


9n{x,y) 


(x + iy) n - (x - iy) 7 
2 i 


Let C n be the set of points (x, y) £ C such that n(x, y) = (1,0), i.e., f n (x, y) = 1 
and g n {x,y) = 0. These points can serve as vertices of a regular n-gon. It is also 
clear that C n is a subgroup of C isomorphic to Z/nZ, the additive group of residues 
modulo n. 

Over C, in addition to the points of C n there are other solutions of the system 


fn(x,y) = 1 , g n {x,y) = 0 . 

Let us find all these solutions. Using formulas (2.1) we can pass to an equivalent 
system of equations 




(x + iy) n = 1 , (x-iy) n = 1 . 
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Therefore, x + iy — e p and x — iy — e q for e = exp(27ri/n). In particular, x 2 +y 2 — 
s p e q ; hence, the equality x 2 +y 2 — 1 holds if and only if e p — £~ q . Thus, C n can 
be characterized as the set of all solutions of the system of equations 

(2.2) f n (x,y) = 1 , g n (x,y) = 0 , x 2 + y 2 = 1 . 

Let us consider the field K n generated over Q by the Cartesian coordinates 
of all the points of C n . For example, C 3 consists of points (1, 0) and (—1,±^); 
and, therefore, K 3 = Q(\/3); C 4 consists of the points (± 1 , 0 ) and ( 0 , ± 1 ); hence, 
K 4 =Q. 

Let cr be an automorphism of K n identical on Q. Since the coefficients of 
the polynomials f n and g n are integers, a sends any solution of system (2.2) in 
another solution of this system, i.e., a determines a permutation of points of C n . 
The automorphism a can be uniquely recovered from this permutation because the 
coordinates of points of C n generate the field K n . It is also clear that 

a ((a, b) + (c, d )) = <r(ac — bd , ad + be) = <r(a, b) + a(c, d), 

i.e., it is not an arbitrary permutation of points of C n that corresponds to a but 
an automorphism of the group Z/nZ. Therefore, the group G n of automorphisms 
of K n over Q is isomorphic to a subgroup of the group Aut(Z/nZ). 

The idea of the remaining part of the proof is as follows. First, we will show 
that if n = 2 a pi ■ ■ ■ Pk, where the pi are distinct Fermat primes, then the order of 
the group Aut(Z/nZ) is a power of 2. In particular, the order of G n is also a power 
of 2. 

Next, we will prove that if the order of the group G is equal to 2 k , then there 
exists a sequence of subgroups 

G — G° D G 1 D ■■■ D G k = {e} 

such that G l is a subgroup of of index 2 for i = 1 , ... , k. 

Finally, using this sequence of subgroups we will construct a sequence of qua¬ 
dratic field extensions beginning with Q and terminating with K n . The existence 
of such a sequence of extensions implies that all elements of K n (in particular, the 
coordinates of points of C n ) are quadratic irrationalities, i.e., can be constructed 
using a sequence of square roots. 

4.2.1. Lemma. The order of the group Aut(Z/rzZ) is equal to a power of 2 if 
and only if n = 2 a pi • ■ - pk, where the pi are distinct Fermat primes. 

Proof. As is known, all automorphisms of the additive group Z/nZ are of the 
form x 1 — mx, where m is a number relatively prime with n. Therefore, the order 
of Aut(Z/nZ) is equal to pin), where <p(n) is the number of positive integers that 
do not exceed n and are relatively prime with n. If numbers p and q are relatively 
prime, then ip(pq) — p(p)p(q). Indeed, let 0 < a < p - 1 and 0 < b < q — 1. Then 
the residues after the division ofn — pq numbers of the form aq + bp by n form the 
complete system of residues modulo n. 

To prove this, it suffices to observe that Gq q + b\p = a 2 q + & 2 P (mod pq) if and 
only if (a\ ~a 2 )q = (b 2 ~bi)p (mod pq), i.e., ai = <22 (mod p) and b\ = 62 (mod q). 
It is also clear that the numbers aq + bp and pq are relatively prime if and only if 
a and p and also b and q are relatively prime. 
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If n — p k , where p is a prime, then <p(n) = p k ~ 1 (p ~~ 1). Indeed, among the 
numbers that do not exceed n only the numbers p, 2 p, ... , p k ~ 1 p have a common 
divisor with n. The total number of such numbers is equal to p k ~ l . 

Let n = p kl ■ • -p 1 ^ 1 . Then p(n) is the product of the numbers — 1). 

The number p k ~ 1 (p — 1) can be a power of 2 only in the following two cases: 

a) p — 2 and k is an arbitrary positive integer; 

b) p — 1 = 2 C and k = 1. 

In the second case the number c cannot have odd divisors. Indeed, if d is an 
odd divisor of c, then 2 C + 1 = x d + 1 is divisible by x + 1. Hence, p is a prime of 
the form 2 2i + 1. □ 

4.2.2. Lemma. If the order of a group G is equal to 2 k , then there exists a 
sequence of subgroups 

G = G° D G 1 D ■■■ D G k — {e} 
such that G % is a subgroup of G 4-1 of index 2 for i = 1 ,... ,k. 

Proof. Let us apply induction on k. For k = 1 the statement is obvious. 
Suppose that the statement is proved for all groups of order 2 fe " 1 . For every element 
x G G consider the class of elements conjugate to it, i.e., the set [gxg~ l ] of elements 
of the form gxg ~ 1 , where g G G. Any two such classes either coincide or are disjoint, 
i.e., G is divided into the union of nonintersecting classes of conjugate elements. The 
equality g\xgf 1 = g-ixgf 1 is equivalent to the equality xh — hx, where h = gf l g\. 
Consider the subgroup 

G x = {h G G | xh — hx}. 

The elements g\xgf l and g^xgf 1 are equal if and only if g\ E gzG x . Therefore, the 
number of elements in the class \gxg ~ 1 ] is equal to the index of the subgroup G x 
in G\ hence, it is of the form 2 s . 

The class [ gxg ” 1 ] contains exactly one element only if x commutes with all 
elements of G, i.e., x is an element of the center of G. Suppose that the center of G 
consists of the unit element only. Then the sum of cardinalities of all the conjugacy 
classes is equal to 1 + 2 Sl + ■ • ■ + 2 s p , where s* > 1. Hence, this sum is an odd 
number. On the other hand, it is equal to the order of G. i.e., it is equal to 2 k . 
This contradiction implies that the center of G contains an element a ^ e. 

The element a generates a cyclic subgroup of order 2 r . Let us consider an 
element b = a m , where m = 2 r ~ 1 . The element b generates a subgroup H of order 
2 that belongs to the center of G; in particular, II is a normal subgroup. By the 
induction hypothesis for the group F = G/H of order 2 fc ~ 1 , there exists a sequence 

F = F°Df l1 d • • • D F^ 1 = {e}, 

where F' 1 is a subgroup of F 1 ^ 1 of index 2 for i = 1, ... , k — 1. To get the required 
sequence of subgroups, set G k = H and G l — F l U bF l for i = 0, ... , k — 1. □ 

Let n — 2 a pi ■ • -p TO) where the p t are distinct Fermat primes. Then the group 
G n of automorphisms of the field K n over Q has a sequence of subgroups 

G n = G° d G 1 D • • • D G k = {e}, 

where G l is a subgroup of G l ~ l of index 2 for % — 1, ... , k. To the subgroup G x we 
assign the set L % consisting of the elements of K n that are fixed under the action 
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of all the automorphisms in G l . Since the sum, difference, product, and ratio of 
elements from IS belong to IS, it follows that IS is a field. It is clear that L k = K n 
and IS D IS* 1 . 

We can show that L° = Q, i.e., for any x G K n \Q there exists an automorphism 
of the field K n over Q that moves x. In general, not every extension of Q possesses 
such a property. For example, the field 

{p + q\/2 + r\f& | p, q, r G Q} 

has no automorphisms distinct from the identity one. Indeed, the element \/2 can 
only pass into a root of the equation rr 3 — 2 — 0 , but only one root of this equation 
belongs to the field considered. 

The reasons why L° = Q will be given a little later. For the moment let us 
assume this without proof. 

All automorphisms in G l preserve the elements of L l . Moreover, G 1 * 1 — G % U 
aG\ where a G G 1-1 . Hence, a 2 G G l because a 2 cannot belong to aG l . Therefore, 
the automorphism a of K n is such that if x E L ®, then a 2 x — x. Moreover, ax = x 
if and only if x 6 L 1 * 1 . Any element x € L l can be represented as the sum of 
elements X\ = \(x + ax) and x^ = \{x — ax), where ax\ = x\ and ax 2 = — x^- 
Therefore, x\ € IS* 1 C L l and X 2 — x — x\ 6 L % . 

Suppose that L l 7 ^ L l ~ l . Let a € L l \ L x ~ l and a — aa — a. Then aa — —a, 
where a / 0. Moreover, a(ax 2 ) — (—cx)(—X 2 ) = olx 2 , i.e., ax 2 € L 1-1 and x 2 £ 
a~ 1 L i ~ 1 . Therefore, iJ — L^ 1 + a~ 1 L l ~ 1 . Hence, if x G L\ then the elements 1, 
x, x 2 are linearly dependent over M -1 ; therefore, x 2 +px+q — 0, where p, q G L 1 " 1 . 
It follows that any element from K n is a quadratic irrationality over L°. 

Now let us prove that L° — O. First, observe that K n is an algebraic extension 
of Q, i.e., the coordinates of points of C n are algebraic numbers. Indeed, the 
solutions of the system of equations 

f n (x,y) = 1 , g n {x,y) = 0 

are algebraic; in order to prove this, it suffices to consider f n (x,y) — 1 and g n {x, y) 
as polynomials of y and examine their resultant. 

Let Q be the set of all algebraic numbers (over Q). It is easy to verify that 
Q is a field. Indeed, let a, (3 G Q. Then a p = a,Q + a\a + • • • + a p _ia p_1 and 
= b 0 + bi/3 + • • • + b q -i(3 q ~ l , where a,i, bj G Q and a 0 b 0 ^ 0. Therefore, any 
element of the ring generated over Q by elements a and (3 can be represented in 
the form of a linear combination with rational coefficients of elements a 1 [P, where 
0 < i < p and 0 < j < q. In particular, each of the elements 1, a + /3, ... , (a+0) pq 
can be represented as a linear combination of the pq elements indicated; hence, they 
are linearly dependent over Q, i.e., a + f3 gQ. 

We similarly prove that a/3 G Q. Moreover, a,Qa -1 = a p_1 — a\ - a p -\aP~ 2 \ 

hence, a -1 G Q. 

Any automorphism r of the field Q over Q sends K n into itself. Indeed, the 
field K n is generated by coordinates of points of C n and C n coincides with the set 
of all solutions (over C) of the system of equations 


fn{x,y) = 1 , g n (x,y)^ 0, x 2 + y 2 = l. 
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The equality L° = <Q> means that 

if a e K n \Q, then there exists an automorphism a : K n —> K n for which 
cr(a) ^ a. 

To prove this, it suffices to indicate an automorphism r : Q —> Q for which 
r(a) ^ a. 

4.2.3. Theorem. Let a and b be two roots of an irreducible polynomial over 
Q. Then there exists an automorphism r : Q —> Q such that r(a) = b. 

Proof. Let K be a .field, a a root of an irreducible polynomial P over K, and 
K (a) the field generated by a over K. Then an arbitrary isomorphism / : K —> K' 
can be extended to an isomorphism g : K(a) —> K'{j3 ), where (3 is a root of the 
polynomial f(P). Indeed, the field K(a ) consists of the elements of the form 
where j > 0 and kj G K. Set g{Yfkjd j ) = This map is 

well defined because the equality J2 k j aj — 0 is equivalent to the fact that the 
polynomial F = kj xP is divisible by P. 

Let us first construct an isomorphism t\ : Q(a) —» Q( 6 ). Then select an 
element t 2 G Q \ Q(a). It is a root of an irreducible polynomial P 2 over Q(a). Let 
t 2 be a root of the polynomial 7 i(P 2 ). 

Next, we can construct an isomorphism r 2 : Q(a, t 2 ) —> Q(b, t' 2 ). Select an ele¬ 
ment t 3 € Q\Q(a,fo) and construct an isomorphism r 3 : Q(a,i 2 ,fo) —+ Q(b, t' 2 ,t' 3 ), 
etc. Since the dimension of Q over Q is countable, we can construct a basis 
{l,£i = a,£ 2 , £ 3 ,...} of Q over Q. The elements t 2 , ... can be chosen so 

that the field Q(a,i 2 , • • • ,tk) contains a subspace generated by elements 1 , £ 1 , ... , 
£k ■ 

As a result, we get a monomorphism r : Q ■—Q such that r(a) = b. It 
remains to verify that r is an epimorphism. Let 71 G Q be a root of an irreducible 
polynomial R over Q and let 71 , ... , 7 n be all the roots of this polynomial. Then 
T ili) e { 71 , ■ • •, 7 n}> where all the numbers r( 7 i),... ,r( 7 n ) are distinct. Hence, 
the sets { 71 ,... , 7 n } and (t(7i), ... ,t( 7 n )} coincide. In particular, 71 = t(tj) for 
some j. □ 

It is possible to significantly generalize Theorem 4.2.3. First observe that <Q> is 

algebraically closed. Indeed, let xq be a root of the polynomial a+/3x-\ - \-ux n — 0, 

where a, fd ,..., u> G Q. Let us consider a polynomial R irreducible over Q and with 
a root a. Let or, ... , a p be all the roots of R. Then all elementary symmetric 
polynomials of aq, ... , a p can be expressed in terms of the coefficients of R and, 
therefore, are rational. Let us similarly define Pi, ... , fd q ] ... ; 07 , ... , ui r and 
consider the polynomial 

P(x) = J"J (Cli+PjXd - \-UkX n ). 

This polynomial is nonzero and all its coefficients can be expressed in terms of the 
elementary symmetric polynomials cr s (a 1 ,..., a p ), ..., a t (u)i,... ,u> r )-, hence, its 
coefficients are rational. Since P(xq) = 0 , it follows that x 0 G Q. 

In the general case the following statement holds for the automorphisms of an 
algebraically closed field fl over its subfield K. 

4.2.4. Theorem. If the elements x,y G Ll are transcendental over K, then 
there exists an automorphism of over K that sends x into y. If the elements 
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x,y € are roots of the same irreducible polynomial over K, then there exists an 
automorphism of over K that sends x into y. 

We will not prove this theorem for arbitrary fields (for the proof see [B3], Ch. 
IV, §6, Prop. 3), but for the most interesting case — the automorphisms of C 
over <Q) — we will prove not only this theorem, but several of its generalizations. 
For example, we will prove that the cardinality of the set of automorphisms of C 
coincides with the cardinality of the set of all maps C —> C (i.e., it is greater than 
the cardinality of C). Our exposition follows [C15, Cl8]). 

First, observe that a field isomorphism <p : F —> G can be extended to an 
isomorphism <p' : F(a) —» G(j3) if and only if the following conditions hold: 

1) if an element a is algebraic over F and P is an irreducible polynomial over 
F with root a, then (3 is a root of the polynomial <p(P)‘, 

2) if a is transcendental over F, then (3 is transcendental over G. 

For our arguments we will need Zorn’s lemma. The point is that proofs by 
induction are only applicable to countable sets whereas the dimension of C over 
Q is uncountable. Therefore, to work with automorphisms of C over Q we need 
another technique and Zorn’s lemma is sufficiently convenient for this purpose. 

Before we formulate Zorn’s lemma, let us give several definitions. Let g be a 
set. Denote by 2 9 the set of all the subsets of g. A set A C 2 9 is called a chain 
if for any pair of its elements a, b G A either a C b or b C a (recall that a and b 
are subsets of the same set g). A set B C 2 9 is called Zorn closed if for any chain 
A C B the set B contains also the union of all the elements of A. An element 
m G B is called maximal if the set m C g is not contained in any other subset a C g 
which is an element of B (i.e., a € B). 

4.2.5. Zorn’s lemma. Every nonempty Zorn closed set B C 2 9 contains at 
least one maximal element m. 

With the help of Zorn’s lemma we can extend any automorphism of a subfield 
of C to an automorphism of the whole field C. For that, we have to apply Zorn’s 
lemma to the family of automorphisms that extend <p. But there is one difficulty. 
It might happen that an extension of an automorphism of the field F to the field F' 
containing F does not leave F' invariant, so that an extension of an automorphism 
is not an automorphism of F' but an isomorphism of F' with some other field. 
For example, consider the automorphism of the field Q(\/2) given by the formula 
a + by/ 2V-> a — by/2. Its extension to Q( \/2) is as follows: 

a + by/2 + cV2 + dy/% >-> a + iby/2 — cy/2 — idy/%. 

This map is an isomorphism of Q(^2) to Q(z</2) but not an automorphism of 
Q(v / 2). 

To overcome this difficulty let us first prove the following statement. 

4.2.6. Theorem. Any field isomorphism : F —> G can be extended to an 
isomorphism F —> G of algebraic closures. 

PROOF. Consider all possible extensions of the isomorphism <p to an isomor¬ 
phism (p a : F a —> G a , where F a C F and, therefore, G a C G. Let us show that 
the set 


S ~ {the subsets of F x G of the form {(a, <p a (a)) | a € F a }} 
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is Zorn closed. Consider an arbitrary chain in S. By the definition of a chain, for 
any two of its elements the corresponding isomorphisms ip a and pp are such that 
one of these isomorphisms is an extension of the other. This means that to the 
union of all the elements of the chain an isomorphism corresponds, i.e., their union 
belongs to the set considered. 

By Zorn’s lemma the set S has a maximal element. To this element there 
corresponds an isomorphism ip : F' —*• G', so we must prove that F' — F and 
G' = G. Suppose that an element a <E F does not belong to F'. But a is algebraic 
over F' and G is algebraically closed. Therefore, G contains an element b which is 
the root of the image (under ip) of the minimal polynomial of a over F'. Hence, it 
is possible to extend this isomorphism ip : F' —> G' to an isomorphism F'(a) —> 
G'(b ), but this contradicts the maximality of the element corresponding to ip. 

Thus, F' = F. It remains to prove that G' = G. The field G' is isomorphic 
to F; hence, G' itself is algebraically closed. In addition, G' contains G. Hence, 
G' = G. □ 

Now we can prove the theorem on extension of automorphisms of subfields of 
C. 

4.2.7. Theorem. Any automorphism p of a subfield in C can be extended to 
an automorphism of the C. 

PROOF. Consider all possible extensions of the given automorphism (p : F —> 
F to automorphisms p a : F a — > F a , where F a C C. As in the proof of Theorem 
4.2.6, we see that the set consisting of sets of the form {(a, p a (a)) | a e F a } has a 
maximal element. To this element there corresponds an automorphism ip' : F' —> 
F'. We must prove that F' — C. 

Suppose a complex number a does not belong to F'. If a is algebraic over F', 
then by Theorem 4.2.6 there exists an extension of p' to an automorphism of the 
algebraic closure of F' and the latter is strictly bigger than F'. If a is transcendental 
over F', then there exists an extension of ip' to an isomorphism F'(a) —> F'(a) 
sending a to a. In both cases we get a contradiction with the maximality of F'. 
Hence, F' = C. □ 

Observe that it is not always possible to extend an isomorphism of two subfields 
of C to an automorphism of C. For example, there exists an isomorphism C —> 
F C C, where F^€. Such an isomorphism is constructed as follows. Let ai, a 2 ,••■ 
be a countable set of complex numbers algebraically independent over Q. The map 
ai > adetermines an isomorphism 

Q(ai,a 2 ,...) •—-*■ Q(a 2 ,a 3 ,...) c Q(a 1 ,a 2 ,...). 

Consider all possible extensions of this isomorphism to isomorphisms 9 a : F a —» 
G a ( F a ,G a C C) such that a\ is transcendental over G a . 

By Zorn’s lemma the set {F a } has a maximal element which, as is easy to show, 
coincides with C. Therefore, we get an isomorphism C —> F C C, where the field 
F does not contain a\ and, therefore, F C. 

The proof of Theorem 4.2.4 for the case of automorphisms of C over Q is not 
a problem now. Indeed, if the complex numbers x and y are transcendental, we 
can consider an automorphism of the field Q(x,y) that interchanges x with y. By 
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Theorem 4.2.7 it is possible to extend this automorphism to an automorphism of 
the field C. If x and y are roots of the same irreducible polynomial over Q, then 
there exists a field isomorphism Q(x) —* Q(y) sending x to y. By Theorem 4.2.6 
this isomorphism can be extended to an isomorphism of the algebraic closures of 
Q(x) and Q (y). But the algebraic closures of these fields coincide and, therefore, 
we get not just an isomorphism, but an automorphism. This automorphism can be 
extended to an automorphism of C. 

It is crystal clear now that the cardinality of the set of automorphisms of C is 
not less than the cardinality of the continuum. It turns out that the cardinality of 
the set of all automorphisms of C is, actually, greater than the cardinality of the 
continuum. 

4.2.8. Theorem. The cardinality of the set of all automorphisms of C coin¬ 
cides with the cardinality of all maps € —> C. 

Proof. We have to prove that the cardinality of the set of all automorphisms 
of C coincides with the cardinality of the continuum. It suffices to prove that the 
cardinality of the set of all the automorphisms of C is not less than the cardinality 
of the set of all the subsets of the continuum. A set B C C is called a basis of 
transcendentality over Q if B is algebraically independent over Q and B is not 
contained in any other set of complex numbers algebraically independent over Q. 
The maximality of B implies that C is algebraic over Q(J3).' Therefore, in particular, 
the cardinality of B is equal to that of the continuum. 

Let us show that to any subset S C B we can assign an automorphism c ps of 
C such that to different sets there correspond different automorphisms. 

Consider an automorphism of Q(B) over Q that sends x £ B to x if x £ S 
and to — x if x S. By Theorem 4.2.7 this automorphism can be extended to an 
automorphism <ps of the whole C. Clearly, if x belongs to one of the sets S or T and 
does not belong to the other set, then (fs {%) = — Tt(%) and, therefore, (ps ^ <Pt-0 


Problems 

4.2.1. Prove that any automorphism of the field M is the identity one. 

Hint. Any automorphism a of R should preserve the elements of Q. Moreover, 
the inequality x > y is equivalent to the fact that x — y — c? for a, G R. 

4.2.2. Let / be an irreducible polynomial over Q and cos(2fc7r/n) be a root. 
Prove that all the roots of / are real ones. 

Hint. Any automorphism of Q over Q preserves the field K n C M. (For the 
definition of K n see the beginning of §4.2.) 

§4.3. The equation for the division of the lemniscate 

Let us compute the arc length of the lemniscate r 2 — cos 2 0. If x — r cos 9 and 
y = r sin 9, then 

dx 2 + dy 2 — (cos Odr — s’mOrdO) 2 + (sinOdr A cos Ord6) 2 = dr 2 + r 2 d0 2 . 

Therefore, the differential ds of the arc length can be computed in polar coordinates 
by the formula ds 2 — dr 2 + r 2 d0 2 . 



§4.3. THE EQUATION FOR THE DIVISION OF THE LEMNISCATE 


79 


In our case 2 r dr — —2 sin 0 d6. Therefore, 


dr 2 + r 2 dd 2 = dr 2 + 


r 4 dr 2 


1 — cos 2 29 


dr 2 

1 — r 4 ' 


Hence, 




dx 

sj (1 — x 2 ){l — k 2 x 2 ) ’ 


where k 2 — — 1. 


Thus, r = sn s is the elliptic Jacobi sine function for k 2 — —1. It is true that one 
does not usually consider the Jacobi sine with such a modulus k\ still, the addition 
theorem is, actually, true in this case as well: 


sn(u + v) 


sn u cn v dn v -f sn v cn u dn u 
1 + sn 2 u sn 2 v 


where cn u — \/l — sn 2 u and dn u = y/l + sn 2 u. 
In what follows we set 


ip(s) = sn s 


for k 2 = — 1 . 


The relation 


ds — 


dr 


dip(s) 


VT 


4 a / 1 ~<P 4 (s) 

means that — y/\ — ip 4 . Therefore, the addition theorem can be expressed in 

_ <p(u)if/{v) + <p{v)y'(u) 

”b J -i , 2 / \ 2 ( \ 

1 + (fi 2 (u)ip 2 {v) 

The function possesses a very important property which we will repeatedly 


the form 


use: 


Indeed, setting x = iy we get 


i p{iu) — i<p(u). 


dx 


dy 


VT 


x H 


yr 


r 


Hence, if 


u = 



dy 

V 1 ~y^ 


then r = tp(u) and ir = ip(iu). The relation <p(m) = iip(u) implies that < p'{iu) = 
<p(u) and ip'(—u) — <p(u). 

The relation ip(iu) = i<p(u) means that the lattice of periods of the function <p 
turns into itself under the map u i—> iu. Let us find out what the lattice of periods 
looks like. Let 

u _ f 1 dx 
2 Jo VT - x 4 

Then the length of the lemniscate is equal to 2u>. The number u> plays the same 
role for the lemniscate as the number tt plays for the circle. 
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By definition, <p(u/ 2) = 1 and <p'(u/2) = 0. The properties of the function 
sn imply that <p(u + u) = —<p(u). Hence, ip(u + iu) — —<£>(«) and, therefore, 
<p(u + u±iu) = <p(u), i.e., tu(l ± i) are periods of ip. 

Let us now find the zeros and poles of tp. Let a, 0 € K. Then 


<p(a + i/5) 


cp(a)ip'(P) + i(p'{a)f3(P) 
1 — ip 2 (a)p‘ 2 ((3) 


where all the quantities in the numerator and the denominator are finite. Hence, the 
equation y?(a + i/5) = 0 holds only when ip(a)<p'(P) = ip'(a)(3({3) = 0. The real zeros 
of the functions <p and <p' are, respectively, of the form mu and (to + \ )u, where 
to G Z. Hence, the zeros of ip should be of the form mu+niu or (m+^)u+(n+^)iu. 
Clearly, <p(mu + nio;) — 0. The equation 


<P 






<p'(u) i(p'{u) 

1 + <p 2 (u) 1 ~~ <p 2 (u) 


shows that if ip(u + f) = 0, then ip(u + ^) = oo. Hence, (to + | )u + (n + \)iu are 
the poles of the function <p. The system of zeros and poles of <p is plotted on Figure 
32 (the poles are denoted by crosses); the parallelogram formed by the periods is 
shaded. 



Figure 32 

Let us return to the problem of dividing the lemniscate into equal parts. Let 
the length of the arc of the lemniscate between the origin and the point (r, 9) be 
equal to s. Then r — <p(s). The length of one petal of the lemniscate is equal to u 
and, therefore, the points for which s = ku/n, where 0 < k < n, divide it into n 
equal parts (Figure 33). 
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Figure 33 


To construct these points, it suffices to construct segments of length r = ip(a), 
where a = ku/n. Indeed, since cos 29 = r 2 , it is possible to construct the point 
(r, 9) with a ruler and compass if the segment r is known. Since na = ku>, it 
follows that <p(na ) = 0. The addition theorem for the function <p enables us to 
express p(na ) in terms of p(a), i.e., <p(na) = F n (ip), where p = p(a) and F n is an 
algebraic function. The problem of the division of the lemniscate reduces then to 
that of solving the equation F n (p) — 0, and the division of the arc of the lemniscate 
between the origin and the point (r, 9) into n equal parts reduces to the problem 
of solving the equation F n (p) = r. % 

Let us show that the equation F 2 {p) = r can be solved in square roots, i.e., 
that the arc of the lemniscate can be divided in half with a ruler and compass. By 
the addition theorem 


F 2 (p) - (p(2a) - 


2 (p(a)<p'(a) 
1 + ip 4 (a) 


2y>\/l - F 4 
l+ <p 4 


Squaring the equation 2p>\Jl — </? 4 = r(l + <p A ) and setting x 


<p 2 — <p 2 we get 


x 2 + 4r 2 x + 4 = 0. 


We define odd complex numbers to be numbers of the form a + ib, where a and b 
are integers such that a + b is odd. The function F n (<p) is, generally, algebraic. But 
if n is an odd complex number, then this function is actually rational. To verify 
this, observe first that 

ib — <p(a ± ia) = (1 ± i) . 


and 

<p'(±a ± ia) — y/l- 

(In the latter formula the sign of the root is determined by the condition </?'(()) = 1.) 
Moreover, adding together the expressions for ip(x + y) and <p{x — y) we get 


p{x + y) + p{x ~y) — 


2<p{x)<p'(y) 

1 + ip 2 (x)ip 2 (y)' 


Let y — ±a ± ia. Then 


2<p(x)(l + p 4 ) 

1 — p 4 ± 2i<p 2 (x)(p 2 


ip{x + y) + <p(x -y) = 
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Therefore, if p(x) and p(x — y ) are rational functions of p, then p(x + y) is also a 
rational function of p — p(a). For instance, setting x — a and y — a ± ia, we get 

. s — ip 4 + (2 + i) . . ip 4 + (2 — i) 

<p(2a + tot) = ip - T — /T p(2a-ia)=p 


1 — (1 - 2 i)p 4 ' ' 1 — (1 + 2i)p 4 ’ 

Similar arguments show that if a + b is an odd complex number, then 

fV) 


(3.1) 


p((a + bi)a ) = ip 


Q(p 4 Y 


where P and Q are polynomials with coefficients from the ring Z [i] = {p + qi | p, q £ 
Z}. If b = 0 in (3.1), then the coefficients of P and Q are integers. 

The polynomials P and Q possess certain interesting properties. But before 
we start discussing these properties in the general case, let us compute p(3a) and 
p( 5a). Since 

. . . 2p(2 a)p'(a) 

p(2a + a) + p(2a — a) — 


it follows that 


p( 3a) = -<P 


1 + p 2 (2a)p 2 (a) ’ 
p 8 + 6 v? 4 — 3 


1 + 6p 4 — 3 p 8 

Therefore, the division of the lemniscate into three equal parts is reduced to solving 
the equation p 8 + 6c/3 4 —3 = 0. Clearly, this equation is solvable in square roots. 
To compute p(5a), we can make use of the fact that 


p(3a + 2a) + p(3 a — 2a) 


2<p(3a)<p' (2a) 


1 + </? 2 (3a;)</? 2 (2ai) 

Simple but cumbersome calculations show that 

p 24 + 50</? 20 — 125</? 16 + 300</? 12 — 105</? 8 — 62</? 4 + 5 


(3.2) p(5a) = p 


1 + 50</? 4 - I23p 8 + 300 p 12 - 105 p l& - 62p 20 + 5^ 24 ' 


To solve the equation F$(p) = 0 in square roots, Gauss used the fact that over Z[i\ 
the number 5 factors into the product of 2 + i and 2 — i. Let 0 = (2 + i)a and 
0 — (2 — i)a. Then 

m\ — ip 4 + (2 + i) — ip 4 + (2 — i) — 

= P ■ 1- T, -= & <P(0) = P ■ + 


p(5a) — ip 


1 - (1 - 2 i)p 4 

iip 4 + (2 — i) 

1 — (1 + 2i)ip 4 ’ 


1- (1 + 2 i)p 4 

^ (5a) = ^. zg 4 + .. g± j) 

1 — (1 — 2 i)ip 


4 • 


Observe that the numerator in (3.2) is divisible by the numerators of the fractions 
ip and ip. Dividing the numerator of (3.2) by 

(-ip 4 + (2 + i))(ip 4 + (2 - i)) = p 8 — 2p 4 + 5, 

we get the polynomial 


p 16 + 52p 12 - 26p 8 - I2p 4 + 1. 
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The solution of 5a) = 0 (if we disregard the obvious case ■0 = 0) can be 
obtained by, first, solving the equation iip 4 + (2 — i) — 0 and, next, solving the 
equation 


(3.3) 


— + (2 + l) 4/ r.- r, ■ 

¥>4- Ti -oJvTT = = Vl + 21. 


1 - (1 - 2i)<p 4 


,-r 4 


One can, alternatively, solve the equation —+ (2 + i) =0 and then the equation 


(3.4) 




iip 4 + (2 — i) 

1 - (1 + 2 i)p 4 


= 1})= \fl - 2 i. 


Dividing (3.3) by (3.4) we get a quadratic equation for ip 4 . 

Now we prove certain properties of the polynomials P and Q for arbitrary odd 
complex numbers m = a + bi. Any such number m is, up to a summand of the 
form 2(±1 ± i), equal to either ±1 or ±i, i.e., is equal to i e . 


4.3.1. Theorem. p(ma) - i £ p 


ip- 1 


(p y * + A\(p p +-h A(p_!)/4 


1 + A\p 4 4-b 

p(a) and p — a 2 +b 2 is the square of the norm of m. 

Proof. Let us make use of the relation 

f i £ u)\ ( i e+1 u;\ . . „. 

+ ~2~) ^ V U + ~2~) = _1 *' 

Set x = p(u + £) and y = <p{u + y). Then xy = i and 


y, where p — 


i £ u 


i £+1 co' 


<p I mu + —y- J p I mu + ^ 


(-l) £ b 


i.e. 


x m v m _ , m 

Q(.x) y Q(y) 1 } ' 


Hence, 


P(x) P(ix - 1 ) 


(—l) e . Moreover, P(ix *) = P(x *) and Q{ix x ) 


-i\ _ 


Q(x) Q(ix- X ) 

Q{x~ x ), since P and Q only depend on x 4 . Therefore, both P and Q are of the 
same degree r and Q{x) — A x r P(x~~ x ). We can find the coefficient A using the fact 
that p(~ ) = 1 and p(yy) = i £ ■ Indeed, this means that P — y p j y , i.e., A = i~ e . 
It remains to demonstrate that r = a? + b 2 — 1. Under the transformation 


£ i- 


Z 

m 


the lattice generated by t o and ito passes into the lattice of zeros of the 
function p(ma). Therefore, the zeros of p(ma) form a lattice with the area of 
the fundamental parallelogram equal to uj 2 \m\~ 2 . The area of the parallelogram of 
periods of the function p(a) is equal to 2cc 2 ; hence, it contains 2a; 2 : o/ 2 |m| -2 — 
2jm| 2 zeros of the function p(ma) = Since the degrees of P and Q are equal, 

it follows that p(ma) — 0 holds if and only if either ip = 0 or P(p) = 0. Each 
equation p(ct) = a for various a has precisely two solutions inside the parallelogram 

of periods and, therefore, the degree of P is equal to 


2|m[ 


ml 


1. 


□ 


The roots of the equation P(p) = 0 for odd complex m can be described as 
follows. Let O = o;(l — %) and O' = o;(l + i ) be the periods of p{a). The equation 
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<p(ma) — 0 implies that 

ma — (a + bi)ui — pQ, + p'Ll', 

where p — s ^, p' = Since = zfi, it follows that ma — (p-j-ip')Cl. Denoting 
the complex number p + ip 1 £ Z[i] by v we get ma — iAt, i.e., a = Let v 
run over the complete system of residues modulo m except the origin. Then the 
numbers p{~~) are the roots of equation P(<p) = 0. All these roots are distinct. 
Indeed, the equality 

4 —) = A— 

\m ) \ m 

is possible in two cases only: either when v = v' (mod m) (which is excluded) or 
when 2 (u + u r ) = (1 +i)(l — 2 iu)m, which is also impossible because the right-hand 
side is only divisible by 1 + i whereas the left-hand side is only divisible by 2. Thus, 
all the roots of equation P(p) — 0 are of the form 



where v runs over the complete system of residues modulo m without the zero 
residue. 

The example of calculations of ip(ma) for m — 5 already demonstrates that the 
arithmetic properties of m over the ring Z[i\ are more important than those over 
Z. This observation is supported by the following theorem proved by Eisenstein. 

4.3.2. Theorem. Let m = a + bi be an odd complex number, prime over Z[i]. 
Then the numbers A\, ... , A^ p _iy^ are divisible by m. 

PROOF. Dividing the polynomial A( p _ i )/ 4 + A ( p _ 5 )/ 4 <^ 4 + • * • + p v ~ l by the 
polynomial 1 + A\<p 4 +-(- A^ p _x) IaSP v ~ X we get 

(3.5) ip(ma ) = (p(c 0 + Cip 4 + c 2 p 8 H- ), 

where Cj £ Z[i\ and Co = A( p __iy 4 . 

Another expression for ip(ma) can be obtained if we use that 

(3.6) <p'(a) = \A ~p 4 {a) = 1 A^Sjp 43 (a), 

where Sj £ Q. Indeed, since y»(0) = 0 and p'iO) = 1, it follows that <p(a) = 
a(l + Epjai). The equation ip' 2 = 1 — i p 4 can only be fulfilled if Pj = 0 for j ^ 0 
(mod 4). Hence, <p(a) = cc(l + d\a 4 + d 2 a 8 + •■•), where dj £ Q. Replacing a with 
ma , we get 

(3.7) <p(ma ) = ma( 1 + d\{ma) 4 + d 2 (ma) 8 + •••). 

To compare (3.7) with (3.5), observe also that 

<p k = p k (a) — a k ( 1 4- d\a 4 + d 2 a^A - ) k . 

Therefore, (3.5) can be rewritten in the form 

(3.8) ip(ma) = cqo:( 1 + diQ: 4 + d 2 a 8 + ■••) + cia 5 (l + eia 4 + e 2 a 8 + •••)-!-. 
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Comparing (3.7) and (3.8), we get 


m = c 0 , 

dim 5 = codi + ci, 
d%m ® = Cod2 + Ciei + C 2 , 
cfam 13 = cods + C\e2 + C2/1 + C3, etc. 

It follows that Ck — mHk{m)l ~ l , where H k is a polynomial of degree Ak with 
integer coefficients all of which are relatively prime with an integer l. The formula 
nHk{n)l~ l = Ck(n) holds for any odd complex n G Z[i] (not necessarily prime); 
and Cfc(n) G Z[i]. Hence, nHk{n) = 0 (mod l). 

Let q G Z[i ] be a prime odd complex divisor of l. Then nHk{n) = 0 (mod q) 
for any odd complex number n. On the other hand, the congruence xHk(x ) = 0 
(mod q) cannot have more than deg(xHk(x)) — Ak+1 distinct (modulo q) solutions. 

For any odd complex q the residues after the division of odd complex numbers 
by q form a complete system of residues, because if n\ is even, n = rq +q is an odd 
complex number. The lattice of numbers proportional to q is generated by vectors 
q and iq. The area of the fundamental parallelogram of this lattice is equal to \q\ 2 ; 
hence, the complete system of residues modulo q contains \q\ 2 elements. Therefore, 
\q\ 2 < Ak + 1. Thus, if Ak + 1 < \m\ 2 , then l is not divisible by m. Since mH k {m ) 
is divisible by l, it follows that H k (m) is divisible by l and c k is divisible by m. 

Thus, if m is an odd prime, then ci, C 2 , ••• , C( p _ 5 )/ 4 , where p = \m\ 2 are 
divisible by m. The relation 

Ap-1)/4 + 5)/4V^ 4 d-i- 

= (1 + +-f A (p _ 1)/4 ^ p ~ 1 )(co + cup 4 + c 2 p 8 + • • •) 


implies 


A(p— 1)/4 — c 0 , 

A(p- 5)/4 — Co^4l + Cl, 


Ax = CoA(p_i)/4 + Ci^4(p_5)/4 3-1- C(p_5)/ 4 . 

Hence, the numbers Ai, A 2 , ... , A( p _i)/ 4 = m are divisible by m. □ 


Problems 

4.3.1. Prove that if a,b,x,y,€ M and x + iy — \/a + ib , then x and y are 
expressible in terms of a and b using only square roots. 

4.3.2. ( Eisenstein’s theorem) Assume that f(x) = aoX n + a iT n_1 + • • • + a n , 
where a 3 G Z[i], and for a number m prime over Z[i] the coefficients oq, ... , a Tl 
are divisible by m whereas a n is not divisible by m 2 and ao is not divisible by m. 
Then / is irreducible over Z[i]. 

4.3.3. Prove that if m is an odd complex number prime over Z[i], then the 

polynomial p p ~ 1 + A\ip p ~ 5 H-H A( p _ i)/ 4 from Theorem 4.3.1 is irreducible. 
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§4.4. Proof of Abel’s theorem on the division of the lemniscate 

In this section we give two proofs of Abel’s theorem. One of them — the classic 
one — belongs to Eisenstein, another — the modern one — to Rosen [C14]. The 
cornerstone of these two proofs, as well as of the original proof of Abel, is the 
fact that the lattice of periods of the function 9 ? (a) is invariant with respect to 
multiplication by the complex unit i. This manifests itself most transparently in 
Rosen’s proof. Therefore, we will start with it. 

In Rosen’s approach, instead of the lemniscatic function p(a) the Weierstrass 
function p(z) is used. It corresponds to the lattice A = {2au> + 2 biu \ a, b G Z}. 
Observe that this lattice is contained in the lattice of periods of p but does not 
coincide with it. At the end of this section we will show that for A we have £2 = —4 
and g 3 = 0 , i.e., 

p /2 (z ) = 4 p 3 (z) - ^p(z). 

The possibility to pass to the function p(z) is related to the following statement. 

4.4.1. Lemma. If a segment of length p(a) can be constructed with a ruler 
and compass, then a segment of length p(a) can also be constructed with a ruler 
and compass. 

Proof. Modulo A, the zeros of p are of the form 0, u> , ho, (1 + i)u and its 
poles are of the form ( 34 j^. The function p'(z) also has 

zeros lv, iu , (1 + i) w, whereas 0 is a pole of p'{z). Moreover, 


P 



= P 


/3 + 3 i \ 


and 



= P 


/ 1 + 3i 

V 2 



Consider the function 


9(z) = 


_ P\z) _ 

(p(z) - pi^uj^ipiz) p(^w)) 


In a neighborhood of zero we have p(z) = z~~ 2 -\ -and p'(z ) = —2 z~ 3 -i -; hence, 

g(0) = 0. It follows that g has the same zeros and poles as p. Hence, p(z) — Cg(z). 

Now we prove that it is possible to construct segments of length p (^), p(ffrcj) 
and p(^ca). First of all, observe that segments of length p(o>), p(iu>) and 
p(( 1 + i)u) can be constructed, since these numbers are the roots of the equation 
4.x 3 — lx = 0. Moreover, it is easy to verify that 


p(z±iz) 


i 4p 2 (z) - | 
T 8 p(z) 


Let p(a) be given. Let us consider x = and y = = f • To find p(f), it 

suffices to solve the quadratic equations 


p(a) = - 


i 4p 2 (x) - J 


and p(x) = 


! (y) -1 


8 p(x) ” 8 p(y) 

If it is possible to construct p(a), then it is also possible to construct p'{oi) = 




■) 3 (a) - Hence, it is possible to construct g(tj) and, since p{%) — 1, the 
constant C can also be constructed. As a result, we see that if it is possible to 
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construct p(a), then it is possible to construct g(a)- thus, it is possible to construct 
¥>(«)■ □ 


Therefore, to prove Abel’s theorem, we must verify that if n = 2>i • ■ ‘Pm, 
where the pi are distinct Fermat primes, then the segments of length p{—), where 
k — 1 , ... , n — 1 , can be constructed. 

The map z v —» (p(z),p'(z)) can be considered as a homeomorphism of the 
torus C/A to the curve E defined by the equation y 2 = 4x 3 — ~x. The addition of 
the points on the torus induces under this map an addition of points on E. The 
elements of the group E whose order divides n form a subgroup 


E n — 


( 2 auj + 2 biui ' 
p - 

, V n 


P 


' 2au + 2 biui \ 


n 


)) 


0 < a,b < n 


The zero element corresponds to a = b = 0; this is the infinite point. 

The group E n is analogous to the group C n for the circle (for the definition of 
C n see the beginning of §4.2). By extending the analogy, let us show that if (a, b) 
and (c,d) are points of E, then 


(a, b) + (c, d) = (/(a, b, c, d),g(a, b, c, d)), 

where af(u) = f{cru) and ag(u) — g(au) for any automorphism a of the field C. 
The addition theorem 

(4.1, 

shows that f(a,b,c,d ) = —a — c — for a ^ c. By differentiating equation 

(4.1) and taking into account that p"(z) = 6p 2 (z ) - \g 2 - 6p 2 (z) - | we can 
represent g as a rational function of a, b, c, d with rational coefficients. In the case 
Zi = z 2 (mod A) we can use the formula 

p(2,) = -2p W + i(kg) 2 . 

If zi = —z 2 (mod A), then formula (4.1) remains valid; one only has to consider 
the expressions in both parts as infinite ones. 

With the help of the functions / and g we can get functions f n and g n for 
which ( f n (x , y),g n (x, y)) — n- (x,y). The points of E n are given in this way by the 
equations f n (x,y) — oo and g n {x,y) = co. For the finite points this means that 
the denominators of the fractions f n and g n vanish. 

Now we can consider the field K n generated over Q by the coordinates of the 
finite points of E n . Repeating for E n the same arguments as for C n , we see that 
the group G n of automorphisms of K n over Q is isomorphic to a subgroup of the 
group Aut(I? n ). Since 

E n “ ^A/A “ A/nA “ Z/nZ © Z/nZ, 

it follows that Aut(i? n ) = GL 2 ^LInL). In the case of a prime n the order of 
GL 2 (Z/nZ) is equal to the number of bases of (Z/nZ) 2 , in other words, is equal to 
(n 2 - l)(n 2 - n). This number is divisible by n(n + 1) and, therefore, for n > 2 it 
cannot be a power of 2. Our arguments have reached a dead end! 
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We can only salvage it by a trick which we have repeatedly used in studying 
polynomials for the division of the lemniscate, namely, the invariance of the lattice 
of periods under multiplication by i. In our case this means that p(iz ) and p'(iz ) 
can be expressed in terms of p(z) and p'(z). Let us prove that p(iz) = —p(z) and 
p'(iz) = ip'(z). Indeed, 

P(z) = z~ 2 + J2'((z- A) -2 -A -2 ). 

A<EA 

The invariance of A with respect to multiplication by i implies that p(iz) = —p{z). 
By differentiating this equation we get ip'(iz) — —p'(z). Therefore, the action of i 
on the torus C/A induces the ^-action on E given by the formula i(x, y ) = (—a;, iy). 
On the group A/nA isomorphic to E n the action of k + il € Z[i\ is given by the 
formula 


(2 au> + 2biu>) (mod n) t—» (k + il)(2au> + 2bito) (mod n). 

This action can be translated to the group E n . 

Let F — Q(i), F n be the field generated by the coordinates of the points of 
E n over F, and G n be the group of automorphisms of F n over F. If a € G n , then 
a(i) = i ; hence, cr(i(x,y )) = (-ax, ivy) = ia(x,y). Furthermore, a ((a, b) + (c, d)) = 
a(a,b) + a(c,d). It follows that G n is a subgroup of the group of automorphisms 
of the Z[t]-module A/nA. 

The inverse to the map a + ib i—> (k + U)(a + ib), where a and b are taken 
modulo n, is the map 

k — il, ,. 

a + lbh ~^ k2^j2( a + tb )- 

This map is defined if and only if the number k 2 + l 2 is relatively prime to n. To 
obtain distinct maps, we have to assume that 0 < k, l < n - 1. Thus, the order 
of the group of automorphisms of the Z[i]-module A/nA is equal to the number of 
pairs (k, l), where 0 < k, l < n — 1 and k 2 + l 2 is relatively prime to n. Let $(n) 
be the total number of such pairs. We break the computation of $(n) into several 
lemmas. 

4.4.2. LEMMA. If p and q are relatively prime, then <&(pq) = 4 > (p)#(g). 

Proof. Let 0 < ai,fq < p — 1 and 0 < a 2,&2 < q — 1. Then the pairs 
(a, b) = (ai q + a^p, b\q + 6 2 p) constitute a complete system of the pairs of residues 
modulo pq. Moreover, a 2 + b 2 = (aj + b\)q 2 + (a\ + lff)p 2 . Therefore, 1 

(a 2 + b 2 ,pq) = 1 4=^ {(a 2 + b\,p) — 1 and (a 2 + b\, q) = 1}. □ 

4.4.3. LEMMA. If p is a prime of the form 4k + 3, then $(p) = p 2 — 1. 

PROOF. We must prove that if o 2 + b 2 is divisible by p, then both numbers 
a " and b 2 are divisible by p. Suppose that o 2 + b 2 is divisible by p but at least 
one of the numbers a and b is not divisible by p. Then both numbers a and b 
are not divisible by p\ hence, by Fermat’s small theorem, a p ~ l = 1 (mod p) and 
b p 1 ~ 1 (mod p), consequently, a p_1 + b p ~ l = 2 (mod p). On the other hand, 


1 For brevity we denote the greatest common divisor GCDfai,..., a n ) of oi, ... , a n by 
(&!)•••, a n ). In the next formula n = 2. 
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a v 1 -f b p 1 = a 4k+2 + b 4k+2 = (o 2 ) 2fc+1 + (b 2 ) 2k+1 is divisible by a 2 + b 2 ; hence, it 
is divisible by p. □ 


4.4.4. Lemma. If p is a prime of the form 4 k + 1, then <3?(p) = (p — l) 2 . 

Proof. First of all, let us prove that any prime p of the form 4fc + 1 can be 
represented as the sum of two squares. The simplest known proof of this statement 
was suggested by D. Zagier [C19]. 

Consider the set of all solutions of the equation x 2 +4yz — p in natural numbers. 
It suffices to prove that this equation has a solution for which y = z. In other words, 
the involution a(x,y,z ) = ( x,z,y ) defined on the set of solutions has a fixed point. 
(Recall that an involution is a map / such that f(f{x)) = x for all x\ if fix o) = xo, 
then xq is called a fixed point.) On a set consisting of an odd number of elements any 
involution has a fixed point. Therefore, it suffices to prove that the total number of 
solutions of the given equation is odd. To this end, it suffices to construct another 
involution r of this set that has exactly one fixed point. Namely, we define: 


( (x + 2z, z, y - x - z) 

for 

x <y- z, 

(A) 

t(x, y,z) = < [2y - x, y, x - y + z) 

for 

y - z <x <2y, 

(B) 

[(x ~2y,x - y + z,y) 

for 

2y < x. 

(C) 

It is easy to verify that x ^ 2y and x y 

- z\ 

besides, any solution is indeed 


transformed by t into a solution. 

Let us divide the solutions into three types (A)-(C) according to which of the 
following three inequalities is satisfied: 


x < y — z, y — z < x < 2y, 2y < x. 

The map r sends solutions of type (A) into solutions of type (C), solutions of type 
(B) into solutions of type (B), and solutions of type (C) into solutions of type (A). 
Now it is easy to verify that r is an involution. Only a point of type (B) can be 
fixed. The equation ( x ,y, z) — (2 y — x,y,x — y + z) implies that y = x. Therefore, 
p = x(x + 4 z), i.e., x = y — 1 (here we have used the fact that p is prime). Thus, 
there is exactly one fixed point; namely, the point (1,1, fc) (here we make use of the 
fact that p is of the form Ak + 1). 

Now we prove that for a fixed a 0 the equation x 2 + a 2 = 0 (mod p) has 
precisely two solutions. Indeed, there exist nonzero numbers b and c such that 
b 2 + c 2 = 0 (mod p). Multiplying this inequality by (ac -1 ) 2 we see that b 2 + a 2 = 0 
(mod p), where b\ = abc~ 1 . Therefore, x 2 = b\ (mod p); the solutions of this 
equation are x = ztby. Thus, only 2{p — 1) pairs with nonzero a and b and the pair 
(0, 0) do not enter 4>(y). It follows that 

$(P) =P 2 ~ 1 ~2(p— 1) = (p—l) 2 - □ 


It is also obvious that <3>(2) = 2. 

4.4.5. LEMMA. Let p be a prime, k> 1. Then 4>(p fe ) — (p fe “ 1 ) 2 4>(p). 

PROOF. The numbers a+aip, where 0 < a < p —1 and 0 < cq < p k ~ 4 — 1, form 
a complete system of residues modulo p k . The number (a + aip) 2 + (b + byp) 2 is not 
relatively prime to p k if and only if it is not relatively prime to p, i.e., a 2 + b 2 = 0 
(mod p). It remains to observe that to every pair (a, b) that enters A>(p) there 
corresponds (p fc-1 ) 2 pairs that enter 4> (p k ). 
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Now it is easy to verify that $(n) is a power of 2 if and only if n — 2 a p\ • • •p m , 
where the pi are distinct Fermat primes. Indeed, <f>(2 a p l { 1 ■ ■ ■ ) can be a power 

of 2 only if ki = • • • = k m ~ 1. If p — 4fc + 1, then F(p) = (p - l) 2 . This number is 
a power of 2 only if p = 1 + 2 C . Let p — Ak + 3 and $(p) = p 2 — 1 = (p — 1 )(p + 1) 
be a power of 2. The consecutive even numbers p — 1 and p + 1 can be powers of 2 
only if p = 3. □ 


To complete the proof of Abel’s theorem it remains to verify that for the lattice 
considered, i.e., A = {2au> + 2biu\a,b 6 Z}, we have 

g 2 = 60V' (2 au + 2biu)~~ 4 = ^ and gs = 140 (2ow + 2biu)~~ 6 = 0. 

The second equality is obvious since #3 (A) = —gs(iA) and in our case iA — A. The 
main difficulty is to prove that J2'( au} + biu)~ 4 — jg. 

Consider the three lattices 


Lq — {au + biu}, 

f au + biu 
L i " \ -A- 

l2~- 


au + biu 


a and b are odd j-, 
a — b is oddl. 


Then \L q = Lq U L 1 U L 2 and L 2 = ^L x . Define \L\ = £| 6I T 4 . Then 


16|L 0 | 


;Lo 


\Lq\ + \Li\ + \L 2 \ and \L 2 \ — ^ |Li| — ~4|Li|; 


hence, |Lij = — 5|Lo|- To prove the desired equality |Lo| = jg let us derive one 
more relation between \Li\ and \L 0 \. For this we use the fact that L 0 is the lattice 
of zeros of <p{z) and L\ is the lattice of its poles. Taking into account that ^(0) = 1 
we get 


<p{z) 


n'O 


cegLo 


n 

/3eLi 


z 

p t 


-1 


where the infinite products should be understood as limits of finite products over 
|«|, \fi\ < N as N —> 00. The nonzero elements of the lattices Lq and Li can be 
divided into 4-tuples of the form {±7, iiy}; hence, 



where 0 < arg o, arg (3 < Therefore, 


(4.2) Z ^Jz) ~ Z ~(f = 1 + (l-^il - I To |)* 4 4-• 

The function z~ l ip(z) does not vary under the change of z to —c or to ±iz\ hence, 
( p(z ) = z( 1 + cz 4 + •••). Moreover, (<p'(z)) 2 = 1 — p 4 (z). Hence, 

(1 + 5cz 4 + • • • ) 2 = 1 - z 4 {l + cz 4 + • ■ • ) 4 
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and, therefore, c= It follows that 


(4.3) 


ip{z) 


1 + 4 cz + 


+ 


ML 0 


Comparing (4.2) with (4.3) we deduce that jXi | - |L 0 | = - f. Since |Lj| : 
it follows that |L 0 | = ~r. □ 

Now let us briefly reproduce Eisenstein’s proof, more exactly, a modified ver¬ 
sion suggested by Melnikov (see [CIO]). This proof is based on the study of the 
properties of the polynomials associated with the division of the lemniscate. 

Thus, consider the numbers <p(~) for k = 1,... ,n - 1, where n is a natural 
number. By the addition theorem, it suffices to confine ourselves to the case when 
n is a prime. For n — 2 the theorem was proved in §4.3, where we found an explicit 
form of the division equation and solved it in quadratic radicals. Therefore, suppose 
that n is an odd prime. If n = 3 (mod 4), then n remains prime in the ring Z[i\ 
whereas if n = 1 (mod 4), then n factors in Z [i] in the product of two prime 
conjugate factors m — a + bi and m! — a -bi. 

After calculating ip{~) and we can find y?(~) using the addition the¬ 

orem. Thus, suppose that n is a prime in Z[i], and consider the equation of the 
division of the lemniscate 


(4.4) 


$(a;) — x p 1 + Axx 9 5 + A 2 x p 9 4-f- A 


4 


0. 


By Theorem 4.3.2, for a prime Gauss number n all the coefficients Ai are divisible 
by n and, moreover, A P ~i = n. Applying Eisenstein’s criterion we see that §(x) is 
irreducible for n prime. Let g be a primitive root modulo the prime n. Clearly, g 
can be selected odd. 

Denote the roots of equation (4.4) by 



j * * * j Xp~. 2 — (p I g 


,P-2 


XT 


n 


We see that each of the roots is the same rational function of the previous root, 
i.e., 


xi+1 = xi 


^gixiY 


l = !»•••, P-3. 


This means that the equation of the division of the lemniscate into n parts is 
abelian, hence solvable in radicals (for an exhaustive theory of abelian equations 
see the two-volume treatise by Burnside and Panton [B5]). All these radicals are 
quadratic if and only if p = 2 a + 1. If n is a prime Gaussian number, then p is also 
a prime and is of the form p — 2 2/3 + 1. If p is the square of a prime q such that 
Q = 3 (mod 4), then the identity q 2 — 2 a + 1 is only possible for q — 3. 

§4.5. Several remarks on Serret’s curves 

Wenn die Konige baun, haben die Karrner zu tun’ (When kings are building, 
says a German poet, carters have work to do. — A line from xenia “Kant and his 
followers” ascribed to Schiller. — But carters need roads. Not seldom, in the 
history of our science, has it happened that the king opened up a new road into 
the promised land and that his successors, intent upon their own paths, allowed it 
to be overrun by brambles and become unfit for transit.” 
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This maxim of Andre Weil (cited from [B25]) is quite applicable to Serret’s 
curves. It suffices to say that even in the famous Lectures on the History of Math¬ 
ematics in XIX Century by Felix Klein we encounter Serret’s name only once and 
in a quite different context. 

The only mention of these curves that we managed to fish out in the mathe¬ 
matical literature is due to Salmon and is from Victorian times. 

About two years ago Serret’s curves drew the attention of the Yugoslavian 
mathematician A. Lipkowsky. Below we reproduce some of his results. We need 
certain properties of plane algebraic curves. We list these properties only briefly, 

referring for more details to [B23, B4]. 

Choose a projective coordinate system in the complex projective plane. Con¬ 
sider an irreducible homogeneous polynomial f{x,y,z) of degree n in variables 
x,y,z with complex coefficients. If one triple of coordinates (ai,a 2 ,a 3 ) of a point 
P satisfies the equation f(x , y, z) = 0, then any other triple of coordinates of P also 
satisfies this equation since /(rai,ra 2 ,ra 3 ) = r n /(ai, a 2 , a 3 ). The set of all points 
P with this property is called an irreducible algebraic curve of degree n. 

Let C be a curve given by the equation f(x, y, z) = 0. The polynomial f(x, y, z), 
being irreducible, is not divisible by z, so it is uniquely determined by the corre¬ 
sponding nonhomogeneous polynomial f{x,y, 1). Denote it by F(x,y). The equa¬ 
tion F(x,y) = 0 is called the equation of the curve in the corresponding affine 
coordinate system. It is clear that the solutions of the equation F{x,y) — 0 corre¬ 
spond to those points of the curve C that do not lie on the infinite line z = 0. 

Let A = (oi,o 2 ,a 3 ) and B = (b u b 2 ,b 3 ) be two distinct points of the curve 
C. A point P = (®i,*2,a; 3 ) lies on the line L joining A and B if and only if 
x k = sa k +tb k , k — 1,2,3, for some s and t. The values of s and t such that the 
corresponding point lies on the curve C are the solutions of the equation 

f(sa + tb ) = f{sa\ + tbi,sa 2 + tb 2 ,sa 3 + tbs ) = 0. 

Since / is an irreducible polynomial, f(sa + tb ) does not vanish identically in s and 
t. Therefore, it is a homogeneous polynomial of degree n in these variables, and the 
equation f(sa + tb) — 0 is satisfied for exactly n values of the ratio s : t provided 
we take into account the multiplicities. Each such value of s : t determines an 
intersection point of the line L and the curve C. It is convenient to regard a point 
corresponding to a root of multiplicity r as a multiplicity r intersection point of L 
and C. 

Now let us analyze in more detail the intersection of the curve C with a straight 
line L passing through a given point P of C. Let fix, y) — 0 be the equation of 
C in an affine coordinate system in which P has the coordinates (a, b). Then L is 
given by the equations 


x — a T At, 
y = b + pt, 

and any line L passing through P is completely determined by the ratio A : y. The 
intersection points of C and L correspond to the roots of the equation 


/(a + At, b + yd) = 0. 
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Expanding the right-hand side of this equation in powers of t and taking into 
account that /(a, b) = 0 we obtain 

£2 

(fx A + fyfl)t + ( fxx + 2/a;yA/i + fyyfl 2 ) ~ H-=0, 

where f x , f y , etc., are the values of the partial derivatives of / at P. Two cases are 
possible. 

(a) First, let us assume that f x and f y do not vanish simultaneously. Then 
almost all lines passing through P intersect C at P with multiplicity 1. The only 
exception is the line corresponding to the ratio A : p such that 


/* A + f y p = 0. 


This line is the tangent line to C at P. 

(b) Now let us assume that all partial derivatives up to order r — 1 vanish at P 
and at least one derivative of order g does not vanish at P. Then each line passing 
through P intersects C with multiplicity at least r, and exactly r lines have the 
intersection of multiplicity higher than r. These exceptional curves are tangent to 
C at P and correspond to the solutions of the equation 

A r + (^Jfx,...,x,y>f V-+-1- fy,...,yP r ^j —0. 


Note that each exceptional line should be counted as many times as the multiplicity 
of the corresponding root. In case (b) P is called a multiplicity r point of the curve 
C. 

A multiplicity one point of the curve G is called a simple point. Points of 
multiplicity two and more are called singular points. A point of multiplicity r is 
called an ordinary point of multiplicity r if at this point the curve admits r distinct 
tangent lines. A necessary and sufficient condition that (a, b, c) is a singular point 
is given by the equations 

a u \ 9f(a,b,c) df(a, b, c) df(a,b,c) n 

C) = —= —g^- = 0 


in projective coordinates or by the equations 


F(a,b) 


dF(a, b ) 
dx 


dF(a,b ) 
dy 


= 0 


in affine coordinates. 

The singularity criterion can be expressed in projective coordinates as follows. 
A point P of the curve C is a point of multiplicity r if and only if all derivatives of 
/ of order r — 1 vanish at P and there exists a derivative of order r that does not 
vanish at P. 

Now let us consider two projective planes: Pl\ with coordinates (xi,X 2 , £ 3 ) and 
PI 2 with coordinates ( 2 / 1 , 3 / 2 , 2 / 3 )- Define a mapping T : Pl\ > PI 2 by the formula 


Vi — Xj^ki 

where ( i,j,k ) = (1,2,3) and all three indices i,j,k are distinct. The mapping T 
is called the quadratic transformation or blow-up of the plane Pl\ to Pp- This 
mapping has the following properties. 
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(1) Each point of Pl\ with the exception of the points (1,0,0), (0,1,0), and 
(0, 0 , 1 ) is mapped to a certain point of Pl%- The exceptional points are called the 
fundamental points of T; the images of these points are not defined. 

( 2 ) All points of the line X{ = 1, i = 1,2,3, except the fundamental points are 
mapped to the point y-i — 1, y 3 — yk = 0. Three lines xg = 0 are called irregular 
lines of T. 

Denote by t' the mapping 

Xi = y 3 yk 

of PI 2 to Pl\. Clearly, it also possesses properties (1) and (2). Furthermore, we . 
have the following property. 

(3) If x = (x' i, X‘ 2 , x 3 ) is a point that does not lie on an irregular line of the 
mapping T, then the point y — T(x) does not lie on an irregular line of the mapping 
T 1 , and T'(y ) = x. 

Since T' has a similar property, we can say that T and T' define a one-to-one 
correspondence between points of the planes Pl\ and PI 2 that does not lie on the 
corresponding irregular lines. 

If /( Xi,X 2 , xs) — 0 is the equation of a curve on the plane T, then the images 
of the points of this curve under T satisfy the equation 

g{y) = /(stejfe, 2/12/3, Shite) = 0. 

The curve g = 0 is called the algebraic image of the curve / — 0 under the mapping 
T. Let / be an irreducible polynomial and let g be the algebraic image of /. If 
g{y) = p(y)f{y), where p(y) is the product of certain powers of yi and f is not 
divisible by any yi, then the curve f — 0 is called the image of the curve / = 0 
under the mapping T. 

The following fundamental theorem holds: any irreducible curve can be made an 
irreducible curve with only ordinary singularities by applying consecutive quadratic 
transformations. 

To analyze singular points of a curve it is convenient to parametrize parts of 
this curve by formal power series. Denote by C[[t]] the field of formal power series in 
variable t over complex numbers. Each nonzero element u E C[[t]] can be uniquely 
written in the form 

u = (ao + ait + • • • )/t lc , 

where k is an integer and oq 7 ^ 0. The number k is called the order of the series u 
and is denoted by 0(u). 

Let f(xi,X 2 , xs) — 0 be the equation of an irreducible curve C in the complex 
projective plane. Elements u\,U 2 ,u 3 E C[[t]] define a parameterization of the curve 
C if 

(1) f(ui,u 2 ,u 3 ) = 0; 

(2) there does not exist a nonzero element e E C[[t]] such that eu\ E C for 
« = 1,2,3. 

Let u — (u\,U 2 ,u 3 ) be a parameterization of the curve C and h — — min O(ui ). 
The elements {vi,V 2 ,v 3 ) defined by the formula Vi — t h Ui determine the same 
parameterization of C. Moreover, Vi E C[[ij] and at least one of the numbers 
Uj(0) = ai does not vanish. In this case the point a — (ai, < 22 , <23) is called the center 
of the parameterization. It is clear that under a transformation of coordinates the 
coordinates of the center transform in the same way as the coordinates of points. 
Therefore, the center of the parameterization is defined uniquely. 
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If u = (ui,U 2 ,Us) is a parameterization and s e C[[£]j is such that 0(s ) > 0, 
then v ~ {ui(s), U 2 (s), us(s)) is also a parameterization with the same center. If 
0(s ) = 1, the two parameterizations are called equivalent. 

Let us assume that w* € C[[C]] for some r > 1. Then we can use power series in 
the new variable T' = t r . In this case the parameterization u — (u\ , U 2 , M3) (and any 
equivalent parameterization) is called reducible. Otherwise the parameterization is 
called irreducible. An equivalence class of irreducible parameterizations of the curve 
C is called a branch of C. The common center of these parameterizations is called 
the center of the branch. 

If u — (u\,U 2 ,Us) is a parameterization of a branch of P such that 0(tq) > 0 for 
all z and 0{ui) — 0 for at least one i, and if g{x 1, £2, £3) is an arbitrary irreducible 
polynomial, then we can define the order Op (g) of g as the order of the power series 
g(uit U 2 > W3). The positive number r — min Op(L), the minimum being taken over 
equations of all lines L passing through the center of the branch P , is called the 
order of the branch. 

One of the most important applications of the notion of a branch and of the 
order is the following statement. 

STATEMENT. (1) If Q is a multiplicity r point of the curve C, then the sum of 
orders of all branches of C with the center Q equals r. 

(2) A point of C is simple if and only if it is the center of just one branch. 

Now we continue the study of Serret’s curves. 

Recall once again the definition of Serret’s curves. Let p be a fixed rational 
number. Consider the triangle OPM with the vertex O at the origin, the lengths 
of the sides OP and PM equal to y/p and \Jp + 1, respectively, and the angles at 
the vertices O and M equal to a and fl (see Figure 29). 

Let us vary the triangle OPM so that the point O is fixed and the lengths of 
the sides OP and PM are fixed, whereas the angle to between the x-axis and OM 
is determined from the equation 

(5.1) coscj = cos (pa — (p + l)/3). 


Then the locus of vertices M is called Serret’s curve corresponding to the parameter 
p. We denote this curve by S p . 

Let p be the length of the segment OM. Then the coordinates of M are 
x — p cos <v, y — psintn. The law of cosines implies that 


(5.2) 


cos a = 


p 2 - 1 
2 Py/P ’ 


cos (3 — 


p 2 + 1 
2 py/p + 1 ’ 


Therefore, for p rational, the expression (5.1) can be represented as a polynomial 
in cos a, cos/3, sin a, and sin/3. In other words, there is a polynomial dependence 
between the variables x,p and y,p, i.e., a system of polynomial equations 


(5.3) P(x,p) - 0, Q(y,p) = 0. 

Eliminating p from these equations we get the following polynomial equation 
for S p : 


(5.4) 


Fix, Vi = 0. 
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The following are several examples of equations (5.3) for different values of 

-1 + 4 p 2 + p 4 


(5.5) p = 1, 

(5.6) p = 2, a; 

(5.7) p — 3, x = 


£ 


4p 2 

1 - 12p 2 + 27p 4 + 4 p 6 


z 2 +i/ 2 =p 2 ; 


4 • 3\/3p 4 
1 + 24p 2 - 162 p 4 + 256p 6 + 27p 8 
96 • \/3p 6 


x 2 +y 2 =^p 2 \ 


x 2 + y 2 = p 2 \ 


(5.8) p = 1/2, + 

(5.9) p ~ 1/3, »= V3(- 9 + 2V-2p-+3p«) i ^ + ^ = 

The simplest Serret’s curves are plotted in Figures 34-36. 





Figure 36. Curve S 3 
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Let p — | (s, t £ N) be the representation of a positive rational number p as 
the fraction reduced to its lowest terms. It is possible to show that relations (5.3) 
are of the form 


(5.10) 


X = 


ao + flip 2 + • • • + a n p 2n 
bp m 


2,2 2 
x +y = p , 


where n = s +1 and m — 2s + {t — 1). Besides, a 0 — (— t) s and a n — s s . For odd 
denominators t the form of the polynomial F(x,y) in equation (5.4) is, therefore, 
as follows: 

(5.11) F(x, y) - bx(x 2 + y 2 ) s +(t-D / 2 - « 0 - a x (x 2 + y 2 ) - a s+t (x 2 + y 2 ) s+t . 


In this case the degree of the curve is equal to a = 2n — 2s + 2 1 . 

For t even the form of the polynomial F(x, y) is more complicated: 

(5.12) F(x,y) = b 2 x 2 (x 2 +y 2 ) 2s+ ^ t ~ 1) ~[a Q ~ai{x 2 +y 2 )+-- - + a s+t {x 2 +y 2 ) s+t } 2 . 


Its degree is equal to 4s + 4i. 

Here we will only consider Serret’s curve S p with integer values of p, i.e., the 
case of t = 1 . 

Recall (see §3.3) that for the length of an arc of Serret’s curve S p one has 



where k 


p+i' 


On the other hand, changing the notation, one has 

dx 


dl — (coefficient)- , 
V 


where 


y 2 — x 4 — 2 ( 2 p + l)x 2 
Since the right-hand side can be factored, 


y 2 = (x - a)(x + a) ^ + 


where a = y/p ~+T + ^/p, we can apply some birational transformation, first to 
lower the degree from four to three (the transformation x — a — y — ^) and 
then to normalize the roots of the third degree polynomial on the right-hand side 
(a translation to have 0 as one root and a rescaling to make 1 another root). All 
transformations are combined together in the explicit formula 

x^a—, y >-> 2 t(a + 1 )^ 55 , 

where A = and we obtain the elliptic curve 

y 2 — x(x — l){x — A 2 ). 

This curve is called the elliptic curve, associated with Serret’s curve S p . It is 
clear from the previous consideration that this curve is uniquely determined up to 
standard automorphisms of root. 
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It is well known that the lemniscate is a rational curve, i.e., that the lemniscate 
admits a rational parameterization. It turns out that all Serret’s curves S p have 
this property for positive integer values of p. 

4.5.1. Theorem. The curve S p is rational for any positive integer p. 

PROOF. This proof is based on the following statement which, in turn, follows 
from Hurwitz’s theorem [B8]: 

Let the plane algebraic curve be determined by the equation F(x,y ) = 0. Let 
d — deg F and let v be the multiplicity of a singular point. Set 

2 ^ 2 ~ 

where the summation is over all singular points, infinite ones included. The curve 
F(x,y ) = 0 is rational if and only if g = 0. 

The number g is called the genus of the curve. 

Thus, consider the polynomial (5.11), 

F(x,y) = bx(x 2 + y 2 ) p - a 0 - o,i(x 2 +y 2 ) - a p+ i(x 2 + y 2 ) p+1 . 

First of all, let us study its behavior at infinity. For this, split it into homogeneous 
summands 


1) 



(5.13) F(x,y) =F 0 (x,y) + --- + F d (x,y), d = 2p + 2, 
and pass to the homogeneous polynomial of x,y,z: 

(5.14) f(x, y , z) = z d F 0 (x, y) + ■ ■ ■ + zF d -i(x t y) + F d (x, y). 


The system of equations for the singular points of f(x,y , z) is as follows: 

dx dy dz 


(5.15) 


= / = 0 . 


Therefore, the singular points on the infinite line z — 0 are of the form (x : y : 0), 
where ( x , y) is a solution of the equations 


(5.16) 

Here 


dF d (x, y) 
dx 


dF d (x,y) 

dy 


- F d -i{x,y) = 0. 


F d (x,y) = -a p+1 (x 2 + y 2 ) p+1 and F d _ x (x,y) = bx(x 2 + y 2 ) p . 


Therefore, equations (5.16) can be reduced to the system 

x(x 2 + y 2 ) p = y(x 2 + y 2 ) p = 0, 

which yields exactly two points: (±i; 1; 0). 

Let us study the behavior of the curve at these singular points. Consider the 
homogeneous polynomial (5.14) in the chart y = 1 and translate the origin to the 
point (x, z) = (±i, 0) (for the details concerning the technique used below see, for 
example, [B23] and [B4]). Since in both cases the calculations are similar, let us 
carry them through only for (i;l;0), i.e., for (x,z) — (i, 0). In this case after the 
transformations indicated we get the polynomial 


(5.17) b(x + i)x p (x + 2 i) p z — aoZ 2p+2 — a\x{x + 2 i)z 2p - a p+ \X p+1 (x + 2 i) p+1 . 
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The initial form of this polynomial at the point (0; 0), i.e., the sum of monomials 
of minimal degree, is 


(5.18) 


b'x p z — a' n z 2pJr2 


a^xz^ 


a' p x p z - a' p+l x> , 


r .p+i 


where the coefficients b', a' 0 ,... , a' p+1 are determined from the coefficients b, o,q, ... , 
a p + 1 . In this way we get the singular points of multiplicity p +1 with characteristic 
monomials x p z, z 2p+2 , x p+1 . The Newton diagram corresponding to these singular 
points is of the form plotted in Figure 37. 



Figure 37 


Clearly, on the sides of this diagram there are no points with integer coordi¬ 
nates. This means that the singularity has two branches. To compute how the 
genus of the singularity diminishes, we can resolve the singularity by means of 
blowing up. In this case just two blowups suffice. The first blowup is of the form 
x i—> xz , z i * z. This implies that 

x p z + z 2p+2 + x p+1 z p+1 (x p + z p+1 + x p+1 ). 

In the chart x t—> x, z t—»• zx there are no singularities, because the form 
x p z + z 2p+2 + x p+1 turns into x p+1 (z + x p z 2p + 1 ). Thus, as a result of the first 
blowup we get one singular point of the form x p + z p+1 + x p+1 : its multiplicity is 
equal to p. The second blowup yields simple points. Indeed, for x xz, z z we 
have 

x p + z p+l + x p+1 i-> z p (x p + z + x p+1 z). 

If x x, z i—> xz, then 

x p + z p+1 + x p + l i—> x p {l + xz p+l + x). 

The tree of infinitesimally close points is of the form 

(P+ 1) (P) (1), 


and, therefore, the contribution of the two infinite points (±i : 1 : 0 ) to the genus 
g is equal to 


~ (P+ l )P + P(P ~ !) 



-2 


2 


2 
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Therefore, when these singularities are resolved the genus g becomes equal to 

(5.19) 9=1-L- - ± - r - L ~2p‘=p-^2 

where Y' denotes the summation over all singular points lying in the finite part 
of CP 2 . In particular, the inequality (5.19) implies that in the finite part the 
contribution of Y' from the singular points of S p is not greater than p. 

Now let us find the finite singular points of S p . Making use of the symmetry 
of the curve, we may suppose that these singular points lie on the 2 >axis and, 
therefore, satisfy the system of equations 


(5.20) 


dF(x, 0) 
dx 


= F(x, 0) = 0. 


This assumption becomes justified if we can justify that the contribution Y' 
of these singular points is exactly equal to p. 

In order to analyze the system (5.20), it is convenient to make use of the relation 
(5.1). The equation F(x, 0) — 0 is equivalent to the fact that = 0 or tv which, in 
turn, is equivalent to the equation 

(5.21) cos (pa — (p + 1)P) — ±1 


which implies that 


(5.22) 


pa — (p + 1)(3 = qir, g € Z. 


Adding the constraints 
(5.23) sin (3— 

to (5.22) we get the system of equations: 


P 


p+ 1 


sin a and 0 < a, (3, a + (3 < tt 


(5.24) 


0 


-a 


p+i p+i 

/3 = arcsin 


7 r, q € Z, 


p+ i 


since). 


For q = 0 the straight line fi = Yt\ a intersects the curve j3 — arcsin (y sin a) 

at exactly two points. All the other straight lines /? = — ^+i 7r either do not 

intersect the curve at all (for q < 0 and for q > p) or intersect it at exactly one 
point (for q = 1,... ,p). The coordinates of the last intersection point are (n, 0). 
Therefore, our system (5.24) or, which is the same, the initial system (5.20) has 
exactly 2 simple and p double roots. This means that there are no other singular 
points and the genus of the curve is <7 = 0. Hence, all the curves S p for positive 
integer p are rational. □ 


We have mentioned above that the crucial moment in the proof of Abel’s the¬ 
orem on the lemniscate’s division is the invariance of the period lattice of the 
lemniscatic function <p{a) with respect to the multiplication by the complex unit 
i. This invariance is a manifestation of an important general property of certain 
elliptic curves: the presence of so-called complex multiplication. 

Let E be an elliptic curve; as a group manifold it is isomorphic to the quotient 
of € modulo the lattice A spanned by the periods The curve E is also 



§4.5. SEVERAL REMARKS ON SERRET’S CURVES 


101 


isomorphic to the quotient of C modulo the lattice spanned by zui, ZU 2 for any 
z E C and, therefore, we may assume that = 1, o >2 = r and Imr > 0 . Any en¬ 
domorphism of E can be lifted to an automorphism of C and, therefore, it induces 
a multiplication by a complex number z such that zr E A. The endomorphisms 
of E constitute the ring A(E ) containing the subring of the so-called trivial auto¬ 
morphisms, which is a subring isomorphic to Z. The remaining endomorphisms, if 
any, are determined by complex numbers and are called complex multiplications. If 
A(E) 7 ^ Z, we say that E possesses complex multiplication. A generic elliptic curve 
E has no complex multiplications. 

Indeed, suppose that z determines a nontrivial endomorphism of E. Then 
z = a + br, zr — c + dr (a, b , c, d are integers, 6 ^ 0 ) 
and, therefore, 

(5.25) ar + br 2 = c + dr. 


Hence, r must belong to the imaginary quadratic field K and z must belong to the 
ring of integers O(K) of K , because z determines an endomorphism of a Z-module 
of finite rank. Therefore, A(E) is a subring of 0(K ) containing Z and has rank 2 
when considered as the Z-module. Conversely, any such subring R in the imaginary 
quadratic field K can be obtained in this way: it suffices to set E = C/R. 

The so-called -invariant is the most convenient way to express that an elliptic 
curve E has complex multiplication. If E is determined by an equation in the 
Weierstrass form y 2 = x 3 + ax + b, then its j-invariant is given by the formula 

4 a 3 

< 5 ' 26 > ™ 

Two curves are isomorphic if and only if their j-invariants are equal. If E is 
defined over Q, i.e., if a, b E Q, there is a very simple criterion for the existence of 
complex multiplication: E possesses complex multiplication if and only if j(E) is 
an integer ([B12], [B20]). It is possible to show ([B20]) that there are exactly 13 
classes of elliptic curves with complex multiplication and rational j-invariant. The 
values of j-invariant for them are 

2 6 • 3 3 , 2 6 • 5 3 , 0, —3 3 • 5 3 , —2 15 , 

—2 15 3 3 , —2 18 • 3 3 • 5 3 , -2 15 • 3 3 ■ 5 3 • ll 3 , 

_ 2 18 . 3 3 . s 3 .23 s . 29 s , 2 3 -3 3 -ll 3 , 2 4 -3 3 -5 3 , 

3 3 • 5 3 • 17 3 , —3 • 2 15 • 5 3 . 

4.5.2. Theorem. Among the elliptic curves associated with Serret’s curve 
S p only the elliptic curve corresponding to the lemniscate Si possesses complex 
multiplication. 


Proof. Let 


V 


x(x — l)(x — X 2 ), A 2 


1 

¥ 


p t 1 
p 


be the elliptic curve associated with Serret’s curve S p . Its j-invariant is equal to 

(A 4 - A 2 + l ) 3 _ o 8 (p 2 +P+l ) 3 


(5.27) 


28 A 4 (A 2 — l ) 2 ^ p 2 (p+ l ) 2 
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Since 


J — 2 


8 {p(p+l) + 1)' 


2 8 p(p+l)+3-2 8 + ^ I y + ^ 5 ^ 


3 • 2 i 


2 8 


(p(p + l)) 2 

it follows that j can only be integer for p — 1. In this case j = 2 6 • 3 3 . Thus, only 
the elliptic curve corresponding to the lemniscate S\ admits complex multi¬ 
plication. ^ 


In conclusion let us formulate several problems. 

First of all, it would be very useful to find rational parameterizations for Serret’s 
curve S p with positive integer p similar to the lemniscate’s parameterization. 

The main problem, however, is the study of the structure of Serret’s curves for 
noninteger values of p. Here we have two very different families: with odd and with 
even values of the denominator t. It seems very plausible that for odd t all Serret’s 
curves are rational. At the same time the computer experiments demonstrate that 
for even t irrational curves can occur. 

Further on, it is very important to compute the ^-invariants of all the corre¬ 
sponding elliptic curves and study whether or not they admit complex multiplica¬ 
tion. 

Finally, it would be very interesting to check whether Eisenstein’s proof can be 
generalized to curves without complex multiplication. 



CHAPTER 5 


Arithmetic of Cubic Curves 


In this chapter we consider certain well-known diophantine equations. Recall 
that a diophantine equation is a polynomial equation 

f(x = 0 

whose coefficients are integers. If this equation has a solution {xj, x 2 ,..., x n } in 
integers, we will say that {xi,x 2 , ■ ■ ■ ,x n } is an integer solution. A solution of this 
equation in rational numbers is called a rational solution. It is clear that if / is a 
homogeneous polynomial, then the problem of the search for rational solutions is 
equivalent to the problem of the search for integer solutions. 

The sources of the theory of diophantine equations can be traced to the math¬ 
ematics of antiquity. Euclid developed a method for finding the greatest common 
divisor d of two numbers aq and a 2 . This method leads in general to solving the 
diophantine equation 1 

a\X\ + a 2 x 2 = d. 

If a number b is divisible by d, we can multiply xq and x 2 by b/d, getting a solution 
of the equation a\x\ + a 2 x 2 — b. Euclid also had a method for finding the greatest 
common divisor (ai,a 2 ,a 3 ) of three numbers; this method fits n numbers as well. 
It is based on the fact that 

((cti,..., a n — i), cin) ~ (cq ,..., fln)* 

The ancient Greek mathematicians never made use of equations when they 
solved geometric problems. The notion of quantities as geometric objects that they 
developed did not allow them to do this. Consequently, the mathematicians of 
classical Greece were not very interested in equations. (It was Descartes who 
started to use equations systematically for the solution of geometric problems.) 

After Euclid’s time, no new methods for the solution of equations in integers 
were developed until the third century by Diophantus, an Alexandrian mathe¬ 
matician. He developed methods for solutions in integers and rational numbers 
of quadratic and certain cubic equations with two and more unknowns and thus 
constructed a foundation for a new mathematical discipline now called the theory 
of diophantine equations in his honor. He summarized his studies in a vast treatise 
in 13 books under the common title Arithmetics. 

The fate of this treatise is remarkable. Soon after it was written it disappeared 
for more than a millennium and was considered to be lost forever. It was only 
in 1464 that a German scientist Regiomontanus accidentally found 6 of the 13 
books of Arithmetics. It was first published in a Latin translation in 1575. After the 

1 Euclid himself did not, however, deal with solutions of such equations. 
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French edition, prepared by Bachet de Mesiriac, was published in 1621 it became 
a desk-top book for many mathematicians, Pierre Fermat and Rene Descartes 
among them. It was on the margins of his copy of Diophantus’s Arithmetics that 
Fermat wrote one of the most notorious scholia in the history of mathematics: 

Cubum autem in duos cubos, aut quadrato-quadratum in duos 
quadrato-quadratos, et generaliter nullam in infinitum 
ultra quadratum potestatem in duas ejusdem nominis fas est 
divedere; cujus rei demonstrationem mirabilem sane detexi. 

Hanc marginis exiguitas non caperet. 

“It is impossible to expand a cube into two cubes or a bisquare into two bisquares 
or, generally, any power greater than 2 into two powers with the same exponent; 
I have found a truly remarkable proof of this fact but the margins here are too 
narrow for it.” 

The collection of methods developed by Diophantus is still valuable. It is known 
under the name of the method of secants. This method allows one to completely 
investigate any quadratic equation and serves as a prototype for the study of cubic 
equations. 


§5.1. Diophantus’ method of secants. 

Second degree diophantine equations 

Before we consider the general situation, let us illustrate the method of secants 
with a concrete example, one of those that Diophantus considers in his Arithmetics. 
Given the equation 

(1.1) x 2 —y 2 ~ 1, 

we have to find all its rational solutions. Equation (1.1) singles out a hyperbola 
(Figure 38) in the .x;y-plane. 



Figure 38 
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We immediately see that (1,0), i.e., the intersection point P of the curve with 
the T-axis, is a solution. Let us draw the secant 

( 1 . 2 ) y = k{x- 1 ) 


through this point and find the second intersection point of the secant with the 
curve (1.1). To do so, substitute (1.2) into (1.1) and solve the quadratic equation 
obtained for x. As a result we get 

—k 2 ± 1 


The root X\ — 1 is known to us. It corresponds to the point (1, 0); the second root 
X 2 corresponds to the second point required, 


(1.3) 


/ k 2 + l 2k \ 
yA ; 2 — 1 ’ k 2 — 1 / 


For any rational k 4 ±1 this formula determines a point on our curve; hence, 
it determines a rational solution of the given equation; for A; = ±1 the secant 
intersects the curve at point P only (see Figure 38). Conversely, for any rational 
solution, i.e., a rational point M on the curve, the secant PM is given by equation 
( 1 . 2 ) with a rational k (because the legs of the acute triangle PMH are rational). 
Thus, formula (1.3) for all rational k ^ ±1 gives all solutions of equation (1.1) in 
rational numbers. 

This method is applicable not only to the polynomial x 2 — y 2 — 1 but to any 
second degree polynomial of two variables 


p(x, y) = Ax 2 + 2 Bxy + Cy 2 + Dx + Ey + F 


with integer or rational coefficients provided the curve p(x,y ) = 0 has at least one 
rational point Pq — (xq, y 0 ). Indeed, let us draw the line 

(1*4) y-y 0 = k(x - x 0 ) . 

through this point and find the intersection points of the curve p(x,y) = 0 with 
this line. That is, substitute (1.4) into the equation p(x,y) = 0. The polynomial 

p{x,yo + k(x - t 0 )) 

is of degree 2 with respect to a;; if we set 

p(x, yo + k(x - T 0 )) = r(k)x 2 + s(k)x + t(k), 
where r(k), s(k), t(k ) are polynomials of k, then for x we get an equation 

r(k)x 2 + s(k)x + t(k ) = 0 . 


We know one of the roots of this equation, namely, x = xq. Therefore, the other 
root, x ~ Ti, can be found using the relation 


xo + Xi = 


s ( k ) 

r(k)' 


in (1.4) we get 


Substituting the expression X\ = —x 0 
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Thus, the coordinates of the second point are of the form 

(i.s) (*!,»,)=(— vo-k (2x0 + jH)). 

Since s(k ) and r(k) are polynomials with rational coefficients, it follows that for any 
rational k such that r(k') / 0 formula (1.5) gives a rational solution of the equation 
p(x, y) — 0 and, since for rational x\ and y-\ the slope coefficient 

k _ vi ~yo 

X\-XQ 

is also rational, we deduce that this formula gives all rational solutions, provided 
we know at least one rational solution. 

The problem of existence of a rational point on a second degree curve p(x, y) — 0 
is quite tough. It is not always the case that such points exist; for instance, there 
are none on the circle x 2 + y 2 = 3 or on the ellipse x 2 + 82 y 2 = 3. If the curve 
p(x, y) = 0 is reducible over Q, then the problem of finding a rational point can 
be reduced to the study of a linear diophantine equation and, therefore, we can 
confine ourselves to the study of curves p(x, y) — 0 , where the polynomial p(x, y) — 
Ax 2 + 2 Bxy + Cy 2 + Dx + Ey + F is irreducible over Q. It is well known that if the 
discriminant A = AC — B 2 of the quadratic form Ax 2 + 2 Bxy + Cy 2 vanishes, then 
the polynomial p(x , y) can be reduced, by an invertible linear transformation with 
rational coefficients, to the form ax 2 + y , where a ^ 0 , or to the form x 2 — c, where 
c is not a perfect square. If A A 0, then the polynomial p(x, y) can be reduced to 
the form 

( 1 . 6 ) ax 2 + by 2 + c, ab ^ 0 . 

Obviously, the curve x 2 — c = 0 has no rational points; it is also obvious that 
on the curve ax 2 + y — 0 there is a rational point (0,0). Thus, it remains to study 
the curve (1.6). If c — 0, then (0,0) is a rational point of the curve (1.6). Hence, we 
may assume that abc A- 0. First of all, for equation (1.6) to be solvable in rational 
numbers, it is necessary that not all of the coefficients a, b, and c be of the same 
sign. Performing, if necessary, the change of variables 

x H—> x~ x , y t—» yx~ l or x 1 —* xy ~ x , y 1 —* y~ l , 

we can reduce the equation ( 1 . 6 ) to the form 

(1.7) ax 2 + by 2 — c = 0, where a>0,6>0,c>0. 

Here we can assume that a, 6 , and c are relatively prime and square-free integers. 

If x = p/r, y — q/r, where p,q,r € Z, is a rational solution of equation (1.7), 
then the equation 

(1.8) ax 2 + by 2 — cz 2 — 0 

has a nonzero integer solution {x,y,z} = {p,q,r}. Conversely, if equation (1.8) 
has a nontrivial integer solution {x,y, z}, then z / 0 (otherwise we would have 
obtained a nontrivial solution of the equation ax 2 + by 2 = 0 , where a > 0 , b > 0 ) 
and equation (1.7) is solvable in rational numbers. 

Therefore, provided that the above necessary condition is satisfied, the problem 
of existence of rational solutions of equation ( 1 . 6 ) can be reduced to the problem on 
nontrivial solvability in integers of equation (1.8). The numbers o, b, and c can be 
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assumed to be not only relatively prime, but also pairwise relatively prime. Indeed, 
let, for example, a and b be divisible by p. Then 

p(ax 2 + by 2 — cz 2 ) = a\x\ + b\y\ — c\z 2 , 

where X\ — px , y\ — py , a\ — a/p , bi — b/p, c\ — pc. 

Already Legendre elaborated a criterion for the existence of a nontrivial so¬ 
lution of equation (1.8). It goes as follows. 

5.1.1. Theorem. If a, b and c are pairwise relatively prime and square-free 
natural numbers, then the equation 

ax 2 + by 2 — cz 2 = 0 

has a nontrivial integer solution if and only if all three of the following congruences 
are solvable: 

x 2 — be = 0 (mod a), 
x 2 — ac = 0 (mod b ), 
x 2 + ab = 0 (mod c). 

It will be more convenient to pass from equation (1.8) to the equation 

acx 2 + bey 2 = z \. 

If (x, y, Z\) is a solution of this equation, then z\ is divisible by c since c is square- 
free. Therefore, (x, y, z\/c) is a solution of equation (1.8). Thus, instead of Theorem 
5.1.1 it suffices to prove the following statement. 

5.1.2. Theorem. Let a and b be square-free natural numbers. The equation 

(1.9) ax 2 +by 2 =z 2 

has a nontrivial integer solution if and only if there exist integers a, (3 and 7 such 
that ., 

(i) a 2 - a = 0 (mod b), 

(ii) (3 2 —b = 0 (mod a), 

(iii) q 2 + ab/h 2 = 0 (mod h), where h = (a, b). 

Proof. First, suppose that equation (1.9) has a nontrivial integer solution 
(x, y, z). We may assume that GCD(a:, y, z) = 1. It follows from (1.9) that ax 2 ~ z 2 
(mod b). Let GCD(&, x) = d. Then a; = dx 1 and b = db\\ hence, ad 2 x 2 +dbiy 2 — z 2 . 
Therefore, z = dz\ . Thus, b\y 2 is divisible by d, where d is square-free since b is. As 
a result, we see that all the numbers x, y, z are divisible by d, i.e., d—1. Therefore, 
there exists a number x' such that xx' = 1 (mod b). Then a — a 2 (mod 6 ), where 
a — x'z. The solvability of the congruence b — 0 2 (mod a) is similarly proved. 

The congruence (iii) can be rewritten in the form q 2 + a\b\ = 0 (mod h ), 
where a = hai, b = hb\. Here the numbers 01 , b\, h are pairwise relatively prime 
and square-free. Since a and b are divisible by h, it follows that z = hz\\ hence, 
aix 2 + biy 2 — hz 2 . If (x,h) — d, then y and z\ are divisible by d. Therefore, 
d — 1 and there exists a number x' such that xx' = 1 (mod h). By multiplying 
the congruence aix 2 -f b\y 2 = 0 (mod h) by bix 12 we get a\b\ + 7 s = 0 (mod h), 
where 7 = b\x'y. 

Now suppose that all the three congruences (i)-(iii) are solvable. If a = 1, then 
the theorem is obvious. Moreover, we may assume that a > b because if b > a 
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it suffices to interchange x and y. If a = 6 , then by (iii) we have q 2 + 1 = 0 
(mod a). Suppose that one of the prime divisors of a is of the form p = 4k + 3. 
Then q 2 + 1 = 0 (mod p) and 

(-y 2 ) 2 ^ = (— 1)^ = (—l) 2fc+1 = —1 (mod p). 

On the other hand, thanks to Fermat’s small theorem, (q 2 ) 2 ^ = q p-1 = 1 
(mod p). This leads to a contradiction; hence, a is the product of primes of the 
form 4k+ 1 and, perhaps, 2. Each prime factor of a can be, therefore, represented 
in the form of the sum of two squares (Lemma 4.4.4). Recall the formula 

(. x 2 + y 2 ){z 2 + 1 2 ) = (xz — yt ) 2 + (xt + yz) 2 . 


Hence, a = r 2 + s 2 for some integers r and s and there is a solution of equation 
(1.9) of the form 

{x,y,z} = {r, s,r 2 + s 2 }. 

The rest of the proof of Theorem 5.1.2 may be sketched as follows. If a > b > 1, 
then starting with equation (1.9) we construct a new equation Ax 2 +by 2 = z 2 , where 
0 < A < a and for this A the congruences similar to (i)-(iii) are solvable. After a 
finite number of steps and interchanging A with b if A < b we get either A = b or 
6 = 1. In each of these cases we have already shown that a solution exists. From 
this solution we will now construct a solution of equation (1.9). 

By (ii) there exists a number (4 such that (3 2 — b — a Ah 2 , where A is square-free. 
Here we may assume that \(3\ < a/2. First, let us show that 0 < A < a. Since 
(3 2 — aAk 2 + 6 < a(Ak 2 + 1), it follows that A > 0. Moreover, 6^1 and 6 is 
square-free. Hence, (3 2 ^ 6 and, therefore, 4/0. It is also clear that 


( 3 2 tf 2 




ak 2 


a 


^ a 

< ~ < a. 
4 


Let us now prove the solvability of the congruences required. Since j3 2 — b — 
aAk 2 , we have /3 2 = b (mod A); hence, the congruence similar to (ii) is solvable. 
Moreover, (3 = h(3\, where h — (a, 6 ); hence, 

(1.10) hf3 2 — bi — aiAk 2 . 


In particular, h/3 2 = a\ Ak 2 (mod 6 i). It follows from (i) that ai = ha 2 (mod 6 i). 
Hence, h(3 2 = hA(a\k ) 2 (mod 6 i). Since h, k and ai are relatively prime with 6 i, it 
follows that A = p 2 (mod &i). Moreover, it follows from (1.10) that a\Ak 2 = —b\ 
(mod h). Hence, A(aik ) 2 = — a\b\ = q 2 (mod h). Therefore, A = q 2 (mod h). 
The numbers h and b\ are relatively prime; hence, hu + biv = 1 for certain integers 
u and v. Consider a number x = hup + b\vq. Since hu = 1 (mod 6 i) and b\v — 1 
(mod h), it follows that x = p (mod 6 i) and x = q (mod h)\ hence, A = x 2 
(mod bi) and A ~ x 2 (mod h ). It follows that A — x 2 (mod 6 ). 

Now consider H = GCD(A, 6 ). Let A = HA 2 and 6 = Hb 2 - Then (3 = H(3 2 
and H(3 2 - b 2 — aA 2 k 2 . Hence, -A 2 6 2 = a(A 2 k ) 2 (mod H). It follows from (iii) 
that a~ a 2 (mod H). Hence, —A 2 b 2 = r 2 (mod H ). 

Now suppose that the equation AX 2 + bY 2 = Z 2 has a nontrivial integer 
solution. Let us multiply both sides of the equation AX 2 — Z 2 — bY 2 by aAk 2 = 
(3 2 - 6 term-wise. As a result, we get 

a(AXk ) 2 = ( Z 2 - bY 2 ){(3 2 - 6) = {(3Z + bY ) 2 - b{(3Y + Z ) 2 . 
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Hence, equation (1.9) has an integer solution x = AXk, y — (3Y + Z, z — , 8 Z + bY. 
This completes the proof, since 1^0. □ 

In 1950 the Canadian mathematician Holzer refined Legendre’s theorem as 
follows (cf. [C7]). Holzer showed that if equation (1.8) has a nontrivial root, then 
it possesses a nontrivial solution {x, y, z} such that 

|ai| < Vbc, \y\ < \fac, \z\ < Vab. 

This provides us with an effective algorithm for finding a nontrivial integer solution 
of equation ( 1 . 8 ) and, therefore, for finding a rational solution of equation ( 1 . 7 ). 

5.1.3. Theorem. Let a, b and c be pairwise relatively prime square-free num¬ 
bers such that a, b > 0 and c < 0. Then if ax 2 + by 2 -j-cz 2 = 0 has a nonzero integer 
solution, then it also has a nonzero integer solution for which 

x 2 < 6 |c|, y 2 < a|c|, z 2 < ab. 

Proof. (We follow Mordell [C13].) It suffices to prove that z 2 < ab. Indeed, 
then ax 2 + by 2 < ab\c\, hence, x 2 < 6 |c| and y 2 < a|cj. Consider a solution 
{xo,yo, zo} for which (x 0 ,y 0 ) — 1 and Zg > ab. It suffices to prove that with the 
help of this solution one can construct a new solution {x,y, z} for which \z\ < \zg\. 

The triple xo + tx i, yo + ty\, zq + tz\ is a solution if and only if the numbers 
t, X \, y \, z\ satisfy the relation 

(ax \ + by\ + cz\ )t 2 + 2 (ax 0 x 1 + by 0 y 1 + czoz^t = 0 . 

For t 0 we can divide this relation by t and as a result we get t = m/n, where 
m = —2(axQX\ + by^yi + czqZ\) and n — ax 2 + by 2 + cz 2 . After multiplication by 
the denominator n we get a solution 

xo n + xpm, yon + y\m, z 0 n + zpm. 

Let us show that these three numbers are divisible by A = (c,x\yo — yixo)- 
Indeed, by the hypothesis ( c,ab ) = 1 and (x 0 ,y 0 ) = 1 . Therefore, the identity 
ax o + byl + cz% = 0 implies that ( c,abx 0 y 0 ) = 1, hence, ( A,abx 0 y 0 ) = 1. In 

particular, it is meaningful to speak about Xq 1 and y 0 -1 modulo A, i.e., there exist 
integers Xq 1 and such that xqXq 1 = 1 (mod A) and yo^o " 1 = 1 (mod A). Since 
xiyo - yix 0 = 0 (mod A), it follows that x x = yixoy ^ 1 (mod A). Now it is easy 
to show that 

P = axoxi + 6 y 0 yi = 0 (mod A) and Q = ax\ + by\ = 0 (mod A). 
Indeed, 

P = ax 0 (y l xoyf l ) + 6y 0 yi = yi(aa;^ + byDy ^ 1 = 0 (mod A), 

Q = aiViXoVo 1 ) 2 + byf = (axl + by^yfy ^ 2 = 0 (mod A). 

Therefore, 

xqu + x\m = xq (ax 2 + by\ + cz\) - 2 xi(ax G xi + 6 y 0 yi + cz 0 zi) 

= xoQ + cxqzI - 2 x\P - 2 cx\ZqZi — 0 (mod A). 

We similarly prove that the other two numbers are divisible by A. 
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Let 8 be a divisor of A. Then the triple 

x = (xqU + x\m)8~ l , y = (yon + yirnjS^, z = (zqu + zim)8~ 1 

is an integer solution. We have to select this solution so as to satisfy the inequality 
\z\ < \zq\. Clearly, 


-<5 z (_ , ax qXi + by 0 yi 

■— ^1 T 

'Zq V CZo 


ax 0 x i + by 0 yi \ axl + by\ 


Taking into account that czg = —(axf } + by'g) we get 


( 1 . 11 ) 


ax 0 x i + by 0 yi 


+ -T2^Vo x i -XoVif- 


Now we are ready to construct the solution required. For aq and y\ take an 
arbitrary solution of the equation yoX\ — .x’o'i/i = 8 . Then only one condition is 
imposed on 8 , namely, 8 divides c. 

Case 1: c is even. Set 8 — |c and select z\ so that 


Then (1.11) yields 


ax 0*1 + byoy\ . 1 


1 z 1 ab 

- — < - H-o- 

2 Zq 4 4zg 


By the assumption, Zq > ab, hence, \z\ < \zo\. 

Case 2: c is odd. In this case we require the condition 


( 1 . 12 ) 


axi T byi + cz\ = 0 (mod 2). 


This condition corresponds to the choice of a definite parity of z\. Relation (1.12) 
is equivalent to 

n = ax\ + by\ + czf = 0 (mod 2). 

Therefore, the numbers xon+xim, yon+yim, zon+z\m are divisible by 28. Hence, 
if instead of 8 we take 8 ' = 28 it will also satisfy (1.11), which in this case takes the 
form 

—2 8z ( axoX\ + byoyi \ 2 ab , . 2 

- =[z 1 + -A-2-+ — 2 (yoX\ - xoyxf. 

CZo \ cz 0 J cz o 

Set 8 — c and select z\ of the parity required (i.e., such that relation (1.12) holds) 
and such that 

I , ax 0 xi + by 0 yi I ^ , 


Then 


< 1 + ~2 < 2 > 


i.e., \z\ < | so I- 
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Problems 

5.1.1. Which of the following equations have nontrivial integer solutions? 

a) 2>x 2 — by 2 + lz 2 — 0. 

b) lx 2 + 11 y 2 - 19z 2 = 0. 

c) Sx 2 — by 2 — 3 z 2 = 0. 

d) 11a; 2 - 3 y 2 - 41.z 2 = 0. 

5.1.2. Find rational points of the following curves: 

a) x 2 — 3 y 2 = 1; 

b) x 2 + 2 y 2 = 9; 

c) x 2 — by 2 = 1. 

5.1.3. Prove that the equation a; 2 — 2 y 2 — 3 has no integer solutions. 

5.1.4. Let d be a square-free natural number. 

a) Prove that if {x\,y{) is an integer solution of equation a; 2 — dy 2 = 1 and 
{x\ + yiVd) n = x n + y n Vd, where x n and y n are integers, then (x n ,y n ) is also a 
solution of this equation. 

b) Prove that if the equation a; 2 — dy 2 — —l has an integer solution, then the 
equation a: 2 — dy 2 = 1 also has an integer solution. 

c) Prove that equation x 2 — dy 2 — 1 has at least one integer solution. 

d) Prove that if the equation x 2 — dy 2 — n, where n^O, has at least one integer 
solution, then it has infinitely many of them. 

§5.2. Addition of points on a cubic curve 

In his Arithmetics Diophantus did not confine himself to the second degree 
equations. He succeeded in solving certain cubic equations as well and gave a 
general method for finding rational solutions of the equation 

2/(6 - y) = x 3 - x. 

However, a steady interest in third degree diophantine equations first appeared, it 
seems, only in connection with a problem from antiquity on congruent numbers; the 
study of these numbers was initiated by Arab mathematicians in the tenth century. 

A positive number r €E Q is called congruent if it is equal to the area of an acute 
triangle with sides of rational lengths. For example, 6 is the area of the triangle 
with sides 3, 4 and 5; hence, 6 is a congruent number. Let r be congruent and 
a, b, c € Q be the lengths of the sides of an acute triangle with area r. For any 
r E Q we can find s E Q such that s 2 r is a square-free integer. But the area of 
the triangle with sides sa, sb , and sc is equal to s 2 r. Therefore, without loss of 
generality, we may assume that r is a square-free positive integer. 

It is worth observing that in the definition of a congruent number the sides of 
the triangle can be rationals, not necessarily integers. Whereas 6 is the least possible 
area of the triangle with integer sides, it is possible to find an acute triangle of area 
5 with rational sides. Indeed, the triangle with sides §> X’ T has suc ^ an a rea. 
One can prove that 5 is the smallest congruent integer. 

It turns out that the problem of description of all congruent integers can be 
reduced to a third degree diophantine equation. Namely, the following statement 
holds. 

5.2.1. Theorem. Let n be a square-free natural number. Then the following 
three conditions are equivalent: 
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(1) n is a congruent number ; 

(2) there exists a rational number x such that the numbers x, x + n, and x~n 
are squares of rational numbers ; 

(3) on the curve y 2 — x 3 — n 2 x, there exists a rational point (x, y) whose 
coordinate x is the square of a rational number such that the denominator of x is 
an even number and the numerator of x has no common divisors with n. 

PROOF. Let a < b < c be a triple of positive rational numbers such that 
a 2 +b 2 = c 2 and n — \ab . Set x = \c 2 . Then x+n = \{a+b) 2 and x~n — \{a—b) 2 . 
Hence, x, x + n and x — n are the squares of rational numbers. On the other hand, 
if x satisfies condition (2), then set c = ‘l\fx and find a and b from the system of 
equations 

( (a + b ) 2 = 4(x + n), 

[ (a — fe) 2 = 4{x — n). 

In other words, if x satisfies condition (2), then the desired triangle has sides of 
lengths 

a = y/x + n - y/x — n, b = y/x + n + y/x - n, c = 2 y/x. 

In order to prove the equivalence of (2) and (3), let us consider a rational 
number x such that x,x + n and x — n are squares of rational numbers. Then 
x — u 2 and (x + n)(x — n) = u A — n 2 — v 2 . Set y — uv. Then 

(2.1) y 2 — x z — n 2 x, 

i.e., the point (x,y) belongs to the cubic (2.1). Since x = Jc 2 , the denominator of 
x is divisible by 2. Moreover, it is clear that the numerator of x has no common 
divisors with n. 

Conversely, if x = u 2 = {c/2) 2 and x 3 - n 2 x = y 2 , then 
v 2 — y 2 /x — x 2 — n 2 — (x + n)(x — n) 

and we have a Pythagorean triple v 2 + n 2 = x 2 . The numbers x 2 and v 2 = x 2 — n 2 
have the same denominator q 4 and the number q is even by the assumption. Hence, 
the numbers q 2 v and q 2 x are integers and q 2 n is an even number such that q 2 x and 
q 2 n have no common divisors and 

(q 2 v) 2 + {q 2 n) 2 = ( q 2 x) 2 . 

Hence, q 2 v = s 2 -t 2 , q 2 n = 2 st and q 2 x = s 2 +t 2 , where s and t are integers. Since 

(2 s/q) 2 + (2 t/q) 2 = 4x — (2u) 2 , 

the triangle with sides 2 s/q, 2 t/q and 2 u is a right one and its area is equal to 
2 st/q 2 = q 2 n/q 2 — n. 

Theorem 5.2.1 is completely proved. d 

The problem on congruent numbers and also certain other classical problems, 
for example, the problem of finding rational solutions of the equation x 3 + y 3 = 1, 
are particular cases of the problem of finding rational solutions of the general cubic 
equation f(x,y) = 0 for two unknowns, i.e., of finding rational points on the curve 
C defined by the equation f{x,y) =0. 

First, suppose that the cubic f{x,y) = 0 has a singular point O and that 
this point is rational. Any line passing through a singular point intersects the 
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curve with multiplicity at least two. This implies, in particular, that a cubic curve 
cannot have two singular points, O and Oi, because otherwise the straight line 00 \ 
would intersect the cubic with multiplicity at least 4. 

Let us draw a rational straight line x — xo+at, y — yo+bt , a, b <E Q, through the 
point O = (.To, 2/0 )• The intersection points of this line with the cubic correspond 
to the roots of the polynomial F(t) = f(x o + at,y 0 + bt ). The coefficients of F 
are rational and for almost all lines the degree of F is equal to 3 (the degree is 
smaller than 3 only for the lines passing through infinite points of the curve). The 
polynomial F has the root t = 0 of multiplicity 2 corresponding to point O. Hence, 
the third root of F is rational, i.e., it corresponds to a rational point on the curve. 
It is also clear that the straight line that connects O with a rational point on the 
curve is rational. This gives a complete description of the set of rational points of 
a singular cubic curve. 

In what follows we will only consider nonsingular cubic curves. Recall that in 
Chapter 1 we defined addition of points on a nonsingular cubic curves as follows. 
Let E be a fixed point on the given curve, A and B points on the curve and X the 
intersection point of the straight line AB with the curve. The sum of points A and 
B is the intersection point of the straight line EX with the given curve. It is easy 
to verify that if the cubic curve is given by a polynomial with integer coefficients 
and the point E is rational, then the sum of two rational points is a rational point. 

The curve y 2 = x 3 + ax 2 + b is nonsingular if and only if its discriminant 
A = —(4a 3 + 27b 2 ) is nonzero. If we take the infinite point as the zero element, 
then for such a curve it is easy to get explicit formulas for addition of points. Let 
the straight line y = px + q intersect the given curve at points (xi,y t ), i = 1, 2, 3. 
Then 


(px + q) 2 — x 3 + ax + b 

for x — Xi , x 2 , x 3 . The sum of the roots of this equation is equal to p 2 ; hence, 


x 3 = —X\ - X 2 + P 2 = ~Xi - X2 + 


f Ki ~ 
\*1 -*2 


2 




y 3 = px + q - 


V 1 - 2/2 
x\ - #2 


(x 3 - x 1 ) + 3 / 1 . 


Clearly, the coordinates of the sum of the points (xi,yi) and (# 2 , 2 / 2 ) are (x 3 , — 3 / 3 )■ 
If #1 = #2 and y\ = 2 / 2 , it suffices to notice that 


lim 

X\ 


2/1 - 2/2 


X2 


= y'(x 1 ) = 


3x 2 + a 
22/1 


We should separately consider the case x\ — x 2 and y\ ^ y 2 - In this case the 
sum of points is the infinite point of the curve. 

The formulas obtained show that knowing one rational point P of the curve 
y 2 — x 3 + ax+ b we can find the rational points 2P, 3 P, etc. Consider, for example, 
the curve y 2 = x 3 — 2 and point P — (3,5). Then 


2 P = 


/ 129 


383 

1000 


is a new rational point. We can now compute 3 P, 4P, etc. With every step the 
volume of calculations steeply increases. If we denote by x n the first coordinate of 
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the point nP, then 

129 164323 

Xj-Z, X 2 - ^3 - 29241 > 

2340922881 ^ _ 3073261 05747363 

Xa = 58675600 ’ 2:5 _ 160280942564521' 

The numerator of Xu has 71 digits. 

It is worth observing that the points P, 2 P, 3 P, etc. are not necessarily distinct. 
If some of them coincide, then the least number m € Q for which rnP is the zero 
element of the group, i.e., the infinite point, is called the order of the point P. 


Problems 

5.2.1. Prove that the point P = (0,2) on the curve y 2 = x 3 + 4 is of order 3. 

5.2.2. Prove that the point P = (2,4) on the curve y 2 = x 3 + 4x is of order 4. 

Each of the points in Problems 5.2.3 - 5.2.9 is of finite order on the correspond¬ 
ing curve. Find this order. 

5.2.3. P = (0,4) on y 2 = 4a; 3 + 16. 

5.2.4. P = (2,8) on y 2 = 4a; 3 + 16a;. 

5.2.5. P= (2,3) ont/ 2 = a; 3 + 1. 

5.2.6. P = (3,8) on y 2 = a; 3 — 43a; + 166. 

5.2.7. P — (3,12) on y 2 — x 3 - 14a: 2 + 81a:. 

5.2.8. P — (0,0) on y 2 + y — x 3 - x 2 . 

5.2.9. P = (1,0) on y 2 + xy + y — x 3 - a; 2 - 3a; + 3. 

5.2.10. Let f n be the functions on the curve 

y 2 = x 3 + ax + b , where 4a 3 + 27 b 2 ^ 0, 
determined by the relations 

fi = h 
/2 = 2 y, 

/ 3 = 3a; 4 + 6aa; 2 + 126a; — o 2 , 

u = 4 y{x 6 + 5ax 4 + 206a; 3 - 5a 2 a; 2 - 4a6a; - 86 2 - a 3 ), 

/2m = 2/ m (/m+2/m-l ~ /m+2/m+l) for 171 ^ 3 ’ 

/2m+l = fm+2fl. ~ fm-lfm+1 for m > 2. 

Further, let 

9n ~ %fn fn—lfn+1 and 4 yh n fn+2fn — l fn—lfn+1' 


Prove that 


9n hn 
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§5.3. Several examples 

In this section we consider several examples of nonsingular cubic curves deter¬ 
mined by the equation in the normal form 

y 2 = x 3 + ax + b 

or by the equation 

y 2 + 2 cy — x 3 + ax + b 

which can be reduced to the preceding one by the change of variables y i-> y + c. 

We will be mainly interested in the set of rational points on the curve E\ it will 
be denoted by E(Q). This set, as we have already said, is an abelian group whose 
zero element is the infinite point on the curve. This is why it is also convenient to 
consider the infinite point of the curve as a rational point. The first three examples 
are taken from [BIO]. 

Example 1. Consider the following problem: Represent the product of two 
consecutive integers y(y + 1) in the form of the product of three consecutive integers 
(;x — l)x(x + 1) = x 3 — x. This problem leads to the curve E determined by the 
equation 

(3.1) y 2 + y = x 3 - x. 

On this curve, there are six obvious points with integer coordinates (cf. Figure 39): 
(0,0), (1,0), (-1,0), (0,-1), (1,-1), and (-1,-1). 



Figure 39 
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Set P = (0,0). Then all the points indicated are generated by the point P : 

(1,0) = 2 P, (-1,0) - -3 P, (0, -1) = —P, (1, -1) = —2P, (-1, -1) = 3 P. 

The point P actually generates an infinite cyclic group. 

All the points of the form (2n + 1 )P belong to the closed component of the 
curve containing P; the points of the form 2 nP lie on the noncompact component 
and tend to infinity as n grows. 

Example 2. The curve E given by equation 

(3.2) y 2 + y = x 3 - x 2 

has four obvious points with integer coordinates (cf. Figure 40): 

(1,0), (0,0), (0,-1) = -(0,0), and (1,-1) = -(1,0). 

The tangent to the curve E at (1,0) intersects E at the point (0,-1); this means 
that 2(1,0) = (0,0); hence, 2(1,-1) = (0,-1). 



The tangent to E at the point (0,0) intersects E at the point (1,0); this means 
that 2(0,0) = (1,-1). The equations 2(1,0) = (0,0), 2(0,0) = (1,-1) = —(1)0) 
imply that 4(1,0) — (1,-1) = —(1,0), i.e., 5(1,0) = 0. Hence, the subset 

{ 0 , ( 1 , 0 ), ( 0 , 0 ), ( 0 ,- 1 ), ( 1 ,- 1 )} 

is a cyclic subgroup of order 5 in E( Q). Using a more advanced technique one can 
show that there are no other rational points on this curve. 

Example 3. Consider the curve E given by equation 
(3.3) y 2 + y = x 3 + x 2 . 

This curve has four obvious points with integer coordinates (cf. Figure 41): 

(0,0), (-1,0), (0,-1) = -(0,0) and (-1,-1) = -(-1,0). 
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Figure 41 


The point P — (0,0) generates an infinite cyclic subgroup in E( Q). ; 

For instance, it is not difficult to calculate that 

2P = (—1, —1), -3P = (1,1), 3P = (1, —2), 4P=(2,3), 5P = (-j,-?) . 

The tangent T to the curve at the point —2 P intersects the curve at the point AP 
and the line L passing through 2 P and P intersects E at the point —3P; the point 
—5 P is constructed either using the straight line L\ through P and 4P or using the 
straight line L 2 through 2 P and 3 P. 

Example 4. The curve E of this example is given by Fermat’s equation 
(3.4) u 3 +v 3 = w 3 . 


The triples 

(u,v,w) = (1, —1,0), (1,0,1) and (0,1,1) 
are nonzero integer solutions of this equation. All the other integer solutions are 
proportional to these solutions. The cubic equation (3.4) is not given in its normal 
form; however, the transformation 


3w 

u + v’ 



u — v 



(3.5) 


U + V 
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reduces E, or rather, its equation, to the normal form: 

(3.6) y 2 - y - x 3 - 7. 

The transformation (3.5) sends the point (1, —1,0) into the infinite point; the point 
(1, 0, 1) into (3, 5) and the point (0, 1, 1) into (3, -4). Thus, E( Q) is a cyclic group 
consisting of three elements {0, (3,5), (3, —4)} because if E(Q) had contained other 
rational points (x,y), then there would have existed nontrivial solutions (u , v, w) of 
Fermat’s equation u 3 + v 3 = w 3 corresponding to them. 

Let us establish that 3P = 0 for P = (3,5) by computing —2P using the 
tangent to E at point P. From the equation for the tangent 

(2 y - 1 )y' - Sx 2 

we find the coefficient of the slope of the tangent to E at the point (3, 5): 

A _ „27_ 

2y - 1 9 

Hence, the tangent is given by the equation 

y = 5 + 3(x - 3) = 3a: - 4. 

The tangent intersects the curve (3.6) at the points whose abscissas satisfy the 
equation 

{Sx - 4) 2 - (3® - 4) = x 3 - 7, 


x 3 - 9x 2 + 27x -27 = {x- 3) 3 = 0. 

Thus, the abscissa of the point -2P is x - 3. Hence, -2 P = P, and so we have 
3P = 0. 

Example 5 . Consider the curve 


y 2 — X 3 + k, 


where k is an integer. 

The corresponding diophantine equation was first considered in the seventeenth 
century by Fermat and Bashe de Mesiriac in the particular case k — —2 and 
was later the subject of intensive study. It is as yet unknown for which integers 
k equation (3.7) has at least one rational solution. Bashe claimed (without proof) 
that if a rational solution (x, y) with xy ^ 0 exists, then the method of tangents 
leads to an infinite number of rational solutions. 

In modern terms this means that if the group E(Q) of rational points of the 
curve (3.7) is nonzero, then it is infinite. With certain restrictions this statement 
was proved in 1930 by the German mathematician Fueter. In 1966 a remarkably 
short proof of Fueter’s result was discovered by the English mathematician Mordell 
(see Problem 5.3.2 below). 
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Problems 


5.3.1. Let P 0 — (xo,y 0 ), where y 0 / 0, be a rational point of the curve 

y 2 ~ X 3 + k. 


Show that the tangent at Po intersects this curve at the point Pi — (aq, yi), where 

_ 9^o - y 2 __ 27xq - 36xgyo + 8y$ 

4 yl ’ Vl ~ 8^ ' 

In particular, show that the curve y 2 — x 3 - 2 has rational points P 0 = (3,5) and 
p — f JJ9 383 \ 

1 vioo ’ ioooL 

5.3.2. (Mordell, [C12]) Let k be an integer free of 6th powers and distinct from 
1 and —432. Further, let the curve 

y 2 = x 3 + k 


have a rational point Po = (xo, yo), where xoyo / 0. 

a) Set xq = ^ and y 0 = where (y, q, r ) = 1 , (y, q) — 1 and (q, r) = 1 . Show 
that the coordinate X\ of the point Pi = (xi,yi) from Problem 5.3.1 is given by 
the relation 


q 2 xi 


9p 4 
4 r 2 


- 2 p. 


b) Prove that if ^ is not an integer, then Pi / P 0 . 

c) Prove that if is an integer, then, applying the process indicated in the 
preceding problem to the point Pi, we get a rational point P[ 7 ^ Po- 

d) Using the results of a)-c) prove that the curve y 2 — x 3 + k has infinitely 
many rational points. 

5.3.3. a) Prove that the equation y 2 = x 3 + 1 has no rational solutions distinct 
from (-1,0), (0, ±1), (2, ±3). 

b) Prove that equation y 2 = x 3 — 432 has no rational solutions distinct from 
the solution (12, ±36). 


§5.4. Mordell’s theorem 

In 1901 the great French mathematician A. Poincare conjectured [A13] that 
all rational points P(Q) of an elliptic curve can be obtained as sums of a finite 
number of points. In algebraic terms, this statement can be formulated as follows. 

5.4.1. Theorem. The group E(Q) of rational points of any rational elliptic 
curve E is a finitely generated abelian group. 

Poincare himself considered this statement obvious. In 1922 the English math¬ 
ematician Mordell [Cll] obtained the first rigorous proof of Poincare’s conjecture. 
During the following seven decades various generalizations and new variants of the 
proof of this theorem appeared. We will give one of them here. Our proof has 
a small gap; in order to fill in the gap we need the machinery of the theory of 
divisibility in rings of integer algebraic numbers; this goes beyond the scope of our 
book (see [A13]). 

Consider the curve 


( 4 . 0 ) 


y 2 — x 3 + ax + b, 
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where a, b e Q and A = -(4a 3 + 27 b 2 ) ^ 0. We may assume that a and b are not 
just rational but integer numbers. Indeed, let 


a 


P 

5 

q 


r 

q 


(we do not suppose that the fractions are irreducible). Set 


V 


m 


X 


xi 

q 2 ' 


Then the initial equation takes the form (with integer coefficients in the right-hand 
side): 


yi 


x\ +pq 3 X\ + rq 5 . 


Let ai, « 2 ? be the roots of equation (4.0). Since by assumption a, b £ Z, it 
follows that an, 02 ? «3 are integer algebraic numbers. Thus, 


(4.1) 


r 


(x - oi)(x - o 2 )(x - o 3 ). 


It suffices to keep track of the coordinate x only since two values ±r/o correspond 
to every value Xq. Let us consider, for example, the elliptic curve y 2 — x 3 — 2. The 
point Po = (3,5) lies on this curve. In the preceding section we have shown that 

/129 


2Pn 


100 ’ 1000 


383 \ 


Moreover, 


X\ — Oi 


129 9 ~ 6 ^- 2^4 

100 v ^ 10 


i.e., the point 2 P 0 — (x\,yi) has the following property: the number aq - «i is a 
perfect square in the field Q(v / 2). This property plays a decisive role in the proof 
of Mordell’s theorem. Let us show that if E is a rational elliptic curve (4.1) and a 
is one of the integer algebraic numbers cci, aq, aq? then the map E(Q) —* Q(«) 
that sends the point (xo,yo) € E(Q) into x 0 - a 6 Q(a), can be included in the 
commutative diagram (the proof of commutativity will be given shortly) 


E(Q) -lQ(a) 


E(Q)/2E(Q) — ^Q(a)/Q(a) 2 

in which the addition of points in E(Q) corresponds to the multiplication of cosets 
in Q(a)/Q(o:) 2 . 

The fact that the image of the set E(Q) in Q(a)/Q(o:) 2 is finite (we will not 
prove this fact) is of extreme importance. 

Let us first obtain explicit formulas for addition of points on the curve y 2 = 
x 3 +ax+b in the form most convenient for our goals. Let the straight line y = px+q 
intersect the curve y 2 — x 3 + ax + b at three points (aq, yi), i = 1,2,3. Then 

x 3 + ax + b — (px + q) — (x — Xi)(x — X2)(x — X3). 
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In particular, if x = a is one of the roots of the polynomial a ? 3 + ax + b, then taking 
into account that a 3 + aa + b = 0 , we get 

(aii - ot)(x 2 - a)(a ?3 - a) — (pa + q ) 2 . 


It is also clear that 

Therefore, 

(4.2) x 3 ~a 


pa + q 


Vi(x 2 - a) - y 2 (x 1 - a) 


x 2 - x x 


1 


(yi(x 2 -a) -y 2 (x! - a)' 


(xi -a)(x 2 ~a) V 
If X\ = x 2 (i.e., for the tangent ) we get 

3a ? 2 + a 


x 2 - X\ 


P 


2yi 


Hence, 

(4.3) 

x 3 - a 


1 ( 3a ? 2 + a. \ 2 / 1 ( 2 2 y? 

+a+ ^r. 


If we make use of the relations 


vl = (xi - a)(x i - 0 ! 2 )(a?i - a 3 ), 
a + a 2 + a 3 — 0, cca^as = ~b, a 3 + aa + b — 0, 


then (4.3) can be reduced to the form 


(4.4) 


a ?3 — a 


' 3a ? 2 — a — 2ax\ — 2a 2 
2 yi 


Now we are ready to prove the commutativity of the above diagram. In the 
group Q(a)/Q(a ) 2 the order of any nonunit elements is equal to 2. Therefore, 
formula (4.2) shows that if the classes A,Bg Q(a)/Q(a ) 2 correspond to the points 
P and Q, then the class AB corresponds to the point P + Q. Formula (4.4) implies 
that the unit class E corresponds to the point 2P. 

This statement can be inverted: if the point Q corresponds to the unit class, 
then Q = 2P. In proving this fact we will confine ourselves to the case when the 
elements 1, a and a 2 are linearly independent over Q. Let 


x — a = (uo + u\a + u 2 a 2 ) 2 

= «o + 2 uoWiQ: + (uf + 2wo'Ui)a 2 + 2 iiiU 20 ; 3 + u 2 of 
= (u 2 — 2buiu 2 ) + (2uoiii — 2au\u 2 — bu\) + («i + 2 u 0 u 2 — au 2 )a 2 . 

Then 

(i) Uq — 2bu\u 2 = x, 

(ii) 2iioWi — 2au\u 2 — bu \ = — 1, 

(iii) ui + 2 uqu 2 — au 2 = 0. 

Let us multiply (ii) and (iii) by — u 2 and ui , respectively, and add the equations 
obtained. We get 

uf + auiu 2 + bu\ = u 2 . 
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Since w 2 A 0, it follows that 


J_ = ('M +a ^ + bt 
U 2 \ w 2 ) w 2 

'IL'\ 1 

i.e., the point P = (x\ y'), where x' — —- and y' — —, lies on the curve considered. 
v U2 u 2 

Let us prove that Q — 2P. Relation (iii) implies that 

x' 2 + 2uoy' — a — 0, 


„/2 


Ql _ fjQ' a 1 3T 

hence, Uo = . —. Moreover, w 2 = tut, u\ — x'u 2 = Therefore, 

2y' 2 y f y 


uq + uia + U2<y. 


a — x 12 x' 


_ 1 o 

—— H-rCt H—:a 

2 y' yf y' 


X 


/ 2 


a — 2a:r / — 2a 2 


2 y' 


By (4.4), this implies that Q = 2 P. Therefore, if the points P and Q correspond 
to one class, then P + Q corresponds to the class E and, therefore, there exists a 
point R such that P + Q - 2R. This completes the proof of commutativity of our 
diagram. 

As we have noted above, one can prove that the image of the group E(Q) in 
Q(a)/Q(o;) 2 is finite. (We skip the proof in order to avoid a rather long excursion 
into the theory of divisibility in rings of integer algebraic numbers.) Therefore, the 
rational points of our curve belong to a finite number of classes from Q(«)/Q(«) 2 ; 
let the points Qi ,..., Q m be representatives of these classes. Let us take an arbi¬ 
trary rational point Pq on the curve. This point belongs to the same class as one 
of the points Qi x ; hence, Pq 4- Q tl = 2P\. Similarly, P\ + Q t2 = 2 P 2 . Therefore, 


To + Qi! + 2 Qi 2 — 2(P\ + Qi 2 ) — 4P 2 . 


By continuing similar arguments we find that 

(4.5) P 0 + Qn + 2Q i2 + • • • +.2 k Q ik+1 = 2 k+1 P k+1 . 

In what follows we will show that for any point Pq after finitely many steps we get 
a point Pk+iy for which the numerators and denominators of its coordinates are 
bounded by a constant C independent of Pq. There are finitely many such points 
and, therefore, any point Pq belongs to the group generated by the union of the 
points Qi ,..., Qm with a finite set of points. This is precisely the statement of 
Mordell’s theorem. 

Thus, it remains to prove that the procedure (4.5) leads to a point Pk+i of the 
type desired. 

We can represent any point (Xo,yo ) on the curve y 2 — x 3 + ax + b in the 
form (Jr, jg), where Jr and are irreducible fractions. Indeed, let Xq — ^ be an 
irreducible fraction. Then 

(y 0 g 2 ) 2 = q(p 3 + apq 2 + bq 3 ) = qr, 

where (q,r) — (g,p 3 ) = 1 . Therefore, q — s 2 , r = t 2 and y^s 8 — s 2 t 2 , i.e., y = js- 
Since (p, q) — 1 and (q, r ) = 1, it follows that (p, s) — 1 and (t, s) — 1. 

Let Ao = max(|p|,s 2 ). Then t 2 < Ag(l + \a\ + |bj), i.e., |t| < ciA^ 2 . Thus, for 
the point Pq = (xq, yo) we have determined the number Aq. 
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Similarly, let us determine the numbers p, Ai and u for the points Qp , P\ and 
2Pi, respectively. We would like to estimate Ai in terms of A 0 . First, let us estimate 
u. For convenience, designate the coordinates of the points as follows: 



Here the numbers p , q, r can be assumed to be bounded by a constant (the numbers 
s, t, u are only used as an intermediary result of computations; the final result is 
the triple x 1} y 1} z\). 

The formula for addition of points on the cubic curve can be transformed to 
the following form: 


x 3 = -xi - x 2 + 


2 

yi-V 2 \ {x\x 2 + a) (xi + x 2 ) + 2b - 2y 1 y 2 


Xi ~x 2 J 

Therefore, from the equation P 0 + Q- tl 


(*i - x 2 y 
2 Pi we derive 


u 


s (px + ar 2 z 2 ) (r 2 x + pz 2 ) + 2br 4 z 4 — 2qryz 

( r 2 x — pz 2 ) 2 


The orders of x and z 2 are both equal to Ap, the order of z 4 is equal to Aq, and the 
order of yz is equal to Aq^A ^/ 2 = Aq; hence, u <c 2 Aq. 

Now we show that A x < < c 4 X^ 2 . Then A 2 < c^cj 2 )^/ 4 and, therefore, 

A n < c\(/ 2 , where c = max (l,c|). Hence, for any Ao and for sufficiently large n 
the number A n is bounded by a constant C. It is possible to express formula (4.4) 
in the form 




i.e., 


(s — au 2 ) 1 / 2 — _ ■ “ ■ (x\ — azf — 2ax\z\ — 2 a 2 zf) — eo + e\a + e 2 a 2 , 


s \i /2 
-a) = 


u 


Xt „ X\ 

-4 — a — 2a—^ 


2a* 


~3 

z i 

2 yi 


2yi«i 


where 


eo 


u 


(x\ - azf ), e x 


UXiZ 1 


e 2 


uzf 


tyizi ' * yi y\ 

The expression on the left-hand side is an algebraic integer; hence, eo + e x a + e 2 a 2 
is also an algebraic integer. 

5.4.2. Lemma. Let eo + eia + e 2 a 2 , where e x € Q and a 3 + aa + b — 0, be an 
integer algebraic number. Let A be the discriminant of the polynomial x 3 + ax + b. 
Then Ae* 6 Z. 

PROOF. Let aq, 0 : 2,013 be the roots of the polynomial x 3 + ax + b, and ny, 
<j 2 , 03 be the automorphisms of C that send a into a%, a 2 , 0 : 3 , respectively. For 
7 € Q(a) define its trace , tr( 7 ), by setting 

tr(y) = o-i ( 7 ) + (J 2 ( 7 ) + <*3 ( 7 ) • 

Then tr( 7 ) € Q and if 7 is an algebraic integer, then tr( 7 ) € Z. 
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Let us multiply the equation 7 = Bq + e x a + e 2 Ct 2 by a and a 2 and take the 
traces. We get: 


tr(y) = 3 e 0 + e x tr(a) + e 2 tr(a 2 ), 
tr(ay) = eo tr(a) + e x tr(a 2 ) + tr(a 3 ), 

tr(a 2 y) = eotr(a 2 ) + e x tr(a 3 ) + e 2 tr(a 4 ). 

These three equations can be considered as equations for eo, e X) e 2 (with integer 
coefficients). The determinant of this system is equal to 

3 01+0:2+0:3 a 2 + a 2 + 0^3 

01+0:2 + 03 a 2 + a 2 + o| of + o| + of 

of + o| + o§ of + of + o l of + of + of 


1 

1 

1 


1 

Oil 

Oi\ 

(*1 

Ol 2 

a 3 

• 

1 

Oi 2 

a\ 

a\ 

a\ 

<4 


1 


Oil 


= (01 - O 2) 2 (01 - o 3 ) 2 (02 - o : 3 ) 2 = A. 


Therefore, Lemma 5.4.2 is proved. U 

Thus, the numbers A(2e 0 - ae 2 ) = ^x\ and Ae 2 = are integers. 

Since the fractions % and % are irreducible, it follows that ~ is also an integer. 

z-^ z-y yx't'x. 

Indeed, if the latter fraction were irreducible with the denominator divisible by y x 
(resp. z x ), then this factor could not cancel zf (resp. x 2 ). Therefore, both x 2 and 
z 1 are divisors of the numbers A( 2 eo - ae 2 ) and Ae 2 - 

Consider the equality (4.5) for the numbers 01 , 02 , and o 3 . We see that 

A(2eo — ae 2 ) and Ae 2 can be linearly expressed in terms of (s — o^-a 2 ) ' for i = 1 , 

1 /o 

2, 3 and, therefore, they are of order u 1 / 2 . Thus, we have shown that Ai < ciA 0 
and, in other words, we have established that the set of points Pk+i is finite, as 
required. ^ 

§5.5. The rank and the torsion group of an elliptic curve 

Consider an elliptic curve E determined over Q. By the results of the pre¬ 
ceding section the group E(Q) of the rational points of E is a finitely generated 
abelian group. As any finitely generated abelian group, E(Q) admits the following 
decomposition: 

E{Q) = Z rs x Tors E{Q), 

where te is the rank of E (Q) and Tors E (Q) is the subgroup of elements of finite 
order in E(Q). The number t j e is called the rank of the elliptic curve E and the 
subgroup Tors E (Q) the torsion group of this elliptic curve. It is possible to prove 
that the rank of an elliptic curve is preserved under birational transformations. 

The rank ve is calculated for many elliptic curves over Q. In the majority of 
cases it is rather small: most often it is equal to 0, 1, 2 or 3. As an illustration 
let us give the values of the ranks of the curves y 2 — x s + ax and y 2 = x 3 + a for 
several values of a (cf. [BIO]). 
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Table 1. The ranks of elliptic curves E given by equation y 2 = x 3 +ax. 


rank the values of a 

0 1,2,4,6,7,10,11,12,22, -1, -3, -4, -8, -9, -11, -13, -18, -19 

1 3,5,8,9,13,15,18,19,20, -2, -5, -6, -7, -10, -12, -14, -15, -20 

2 14,33,34,39,46,-17,-56,-65,-77 

3 -82 

Table 2. The ranks of elliptic curves E given by equation y 2 — x 3 +a. 


rank the values of a 

0 1,4,6,7,13,14,16,20,21, -1, -3, -5, - 6 , - 8 , -9, -10, -14, -432 

1 2,3,5,8,9,10,11,12,18, -2, -4, -7, -13, -15, -18, -19, -20, -21 

2 15,17,24,37,43,-11,-26,-39,-47 

3 113,141,316,346,359, -174, -307, -362 

It is yet unknown if there exist elliptic curves of arbitrarily large rank. In 1986 
the French mathematician Mestre constructed examples of elliptic curves of ranks 
3 to 14. For example, the curve 

y 2 + 9767 y = x 3 + 3576t 2 + 425x - 2412 

is of rank r E > 9, and the curve 

y 2 + 357573631^ = cc 3 + 2597055a; 2 - 549082a; - 19608054 

is of rank r E > 14. 

One of the most famous conjectures in modern number theory relates the num¬ 
ber r E to the order of the zero at s ~ 1 of the analytic function corresponding to 
E. This conjecture was made in 1965 by the English mathematicians Birch and 
Swinnerton-Dyer. In order to formulate it, we need some prerequisites. Let 

y 2 = x 3 + ax + b, A = -(4a 3 + 27 b 2 ) ^ 0 

be an elliptic curve. As we observed in the preceding section, we may assume 
without loss of generality that a and b are integers. Let p be a prime. Consider the 
congruence 

y 2 = x 3 + ax + b (mod p ), 
or, equivalently, the equation 

(5.1) y 2 — x 3 + ax + b, where a, b € Z/pZ = F p . 

If the prime p does not divide the discriminant A, then equation (5.1) determines 
an elliptic curve E p over ¥ p called the reduction of E modulo p. Denote by N p the 
number of points of E p with coordinates in ¥ p , the infinite point included. 

For example, the solutions of the equation y 2 = x 3 + Sx considered over F 5 are 

(0,0), (1,±2), (2, ±2), (3, ±1), (4, ±1), 00 ; 


hence, N 5 = 10. 
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What can one say about N p for an arbitrary curve y 2 = x 3 + ax + b over F p ? 
Any such curve contains at least the infinite point. On the other hand, each element 
x from F p produces not more than two values of y and, therefore, we see that N p 
does not exceed 2p + 1, counting the infinite point. Thus, we have the following 
obvious inequalities: 

1 < N p 2 p -T 1 • 

These inequalities can be rewritten as follows: 

\p+l — N p \ < p. 

In 1934 the German mathematician H. Hasse obtained a finer estimate [C 6 ]: 


\p + 1 — ATp| < 2^/p. 


Starting from the numbers N p , define the so-called L-function of a rational 
elliptic curve E by setting 



o v p 


n 

pX a 


1 - a p p~ s + p 


1-2 5 




where a p = p + 1 - N p and A is the discriminant of the given curve. 

From Hasse’s estimate it is easy to derive that the above infinite product con¬ 
verges for Res > |. Conjecturally the function L(E. s ) can be analytically contin¬ 
ued to the whole complex plane. 

5.5.1. Conjecture 1 (Hasse-Weil). For any rational elliptic curve E there 
exists a positive integer N and a sign e — ±1 such that the modified L-function 

A(E,s) = N s / 2 (27r)- s T(s)L(E,s), 
where T(s) is the Euler T-function, satisfies the functional equation 

A (E,s) - -sA(E,2 — s). 

In spite of titanic efforts of many first-rate mathematicians, until the summer 
of 1993 this conjecture was only proved in certain particular cases. In June of 
1993 a wonderful development occurred: Princeton University professor A. Wiles 
announced the proof of the Weil—Taniyama conjecture for semistable elliptic cuives. 
The Weil-Taniyama conjecture implies the Hasse-Weil conjecture. 

In this small book we cannot discuss Weil-Taniyama’s conjecture; we will only 
observe that it states that every elliptic curve can be parameterized in a special 
way by the so-called modular functions. One can judge how deep the properties of 
the numbers mentioned in Weil—Taniyama’s conjecture are from the fact that the 
conjecture implies, in particular, the proof of Fermat’s Last Theorem. 

Using vast empirical material on curves of the form y 2 = x 3 Tax and y 2 = x 3 +a 
and assuming that L(E , s ) can be continued to the whole complex plane, Birch and 
Swinnerton-Dyer came to the following conjecture. 

5.5.2. Conjecture 2 (Birch-Swinnerton-Dyer). Let E be a rational elliptic 
curve. Then its rank r^ is equal to the order of the zero of the function 

L{E,s) n i _ ap p-s _|_pi- 2 s 


at point s = 1 . 
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At the moment the Birch-Swinnerton-Dyer conjecture is only proved in certain 
particular cases. 

Now we pass to the torsion group Tors E(Q). To illustrate the possibilities that 
might occur, let us compute the torsion group for the family of curves y 2 — x 3 + ax. 
As before, by substituting x i—> q 2 x, y q 3 y we get for this family the equation 



This means that we may assume a to be a nonzero integer free of fourth powers. 

5.5.3. Theorem. Let E be an elliptic curve determined by the equation y 2 — 
x 3 + ax, where a is an integer free of fourth powers. Then 

{ Z/2 © Z/2 if a is a perfect square distinct from 4; 

Z/4 ifa = 4; 

Z/2 otherwise. 

Proof. In all cases the order of the point (0,0) is equal to 2 since an arbitrary 
point of order 2 is of the form (#, 0), where x is a root of the cubic equation 
x 3 + ax = 0. In particular, three points of order 2 exist if and only if —a is a perfect 
square. 

Now we consider the equation 2 (x,y) — (0,0). For such a point there exists 
a straight line L : y — Xx passing through (0,0) and tangent to the curve E at 
the point (x,y). Therefore, (A ®) 2 = x 3 + ax so that x(x 2 — X 2 x + a) — 0. Since 
L is the tangent to E at the point (x,y), it follows that the quadratic equation 
x 2 — X 2 x + a = 0 has a double root, i.e., its discriminant A 4 — 4a vanishes. Since a 
is free of fourth powers, the equation 

A 4 - 4a = 0 

has a rational solution if and only if a = 4. In this case the points (x, y) satisfying 
the equation 2(x, y) = (0,0) are of the form (2,4) and (2, —4). Therefore, the points 
of order 2 and 4 in E(Q) form, depending on a, subgroups of Z/2 © Z/2, Z/4 or 
Z/ 2 . So to prove the theorem, it suffices to show that there are no points of odd 
order in E( Q). 

We will carry out the proof of this fact by contradiction. Suppose that E(Q) 
has a point p ^ 0 for which 3 P — 0, i.e., 2 P — —P. Then the tangent y = Xx + (3 
to E at P if substituted into the equation 

y 2 — x 3 + ax 

should give a perfect cube, i.e., the equation 

(5.2) 0 = x 3 + ax — (Xx + (3) 2 = (x — r) 3 , 

where r is the first coordinate of P, should be satisfied. By removing parentheses 
in relation (5.2) we get 

0 = x 3 — X 2 x 2 + (a — 2(3X)x — (3 2 
= x 3 — 3 rx 2 + 3 r 2 x ~ r 3 , 
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from which 3r = A 2 , r 3 = (3 2 or, equivalently, ft 2 = Moreover, the third relation 
between the coefficients, 

3r 2 — a — 2/3X, 


leads to the equation 


— a — 2 


which is impossible for rational a and A. Therefore, Theorem 5.5.3 is proved. 


In a similar way we can prove that if E is the elliptic curve given by the equation 
y 2 = x 3 + ax, where a is a number free of sixth powers, then 

Z/ 6 Z if a = 1; 

Z/3Z if a is a perfect square distinct from 1 or a = —432; 
Z/2 if a is a perfect cube distinct from 1; 

0 otherwise. 

The structure of the group for the curves in the Weierstrass form y 2 = f(x) 
with integer coefficients is considerably clarified by the following statement. 

5.5.4. Theorem. Let E be the elliptic curve y 2 = f{x), where 

f(x) — x 3 + ax 2 + bx + c 

is a third degree polynomial with integer coefficients. If (x, y) is a point of finite 
order of E, then x,y € Z and y divides the discriminant A of f{x). 

This theorem implies the existence of an effective computation algorithm for 
calculating the group Tors A (Q) of E. Namely, consider the finite set of all divisors 
yo of the discriminant 

A — 18a 2 6 2 — 4a 3 c + 18a6c — 46 3 — 27c 2 , a,b,c £ Z, 
and find all integer solutions xo of the cubic equation 

x 3 + ax 2 + bx + c — yl = 0. 

Then all the nonzero points of finite order are contained among these. 

In 1976 the American mathematician B. Mazur proved a wonderful theorem 
on the structure of torsion groups of elliptic curves [C9]: 

5.5.5. Theorem. Let E be an elliptic curve defined overQ. Then Tors E(Q) 
is isomorphic to one of the following fifteen groups: Tj/rnL for m < 10 or m — 12, 
and TLjTL ® Z/2mZ for m < 4. 

In the following table we give examples of rational elliptic curves for which 
these torsion groups are realized. 
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Table 3. Examples of torsion groups of elliptic curves 


E 

TorsE(Q) 

y 2 = x 3 + 2 

0 

y 2 = x 3 + 8 

Z/2 

y 2 = x 3 + 4 

Z/3 

y 2 — x 3 + 4x 

Z/4 

y 2 -f y = X 3 — X 2 

Z/5 

y 2 = X 3 + 1 

Z /6 

y 2 — xy + 2y — x 3 + 2 a ; 2 

Z/7 

y 2 + 7 xy — 6y = x 3 — 6x 2 

Z /8 

y 2 + 3 xy + 6y — x 3 + 6 a ; 2 

Z/9 

y 2 — 7 xy — 3 Qy = x 3 — 18a; 2 

Z/10 

y 2 + 43 xy — 210 y ~ x 3 — 210a ; 2 

Z/12 

y = x° — x 

Z/2 © Z/2 

y 2 = x 3 + 5a ; 2 + 4a; 

Z/2 ©Z/4 

y 2 — 5xy — 6 y =• a : 3 — 3a ; 2 

Z/2® Z /6 

y 2 = a ; 3 + 337a ; 2 + 20736a; 

Z/2 ©Z /8 




CHAPTER 6 


Algebraic Equations 


The remaining part of the book is devoted to the solution of algebraic equations 
of fifth degree by means of theta functions. Originally this problem was solved by 
the famous French mathematician Charles Hermite in 1858. We divide the cor¬ 
responding material into two chapters. Chapter 6 is devoted to common properties 
of algebraic equations, Lagrange’s resolvents, Abel’s theorem on unsolvability in 
radicals of the general equation of fifth degree, and Bring’s form of the fifth degree 
equation. Regarding Lagrange’s resolvents and roots of unity we follow [B6]. In 
Chapter 7, following the famous book by Weber [B24], we describe the scheme of 
solution of an arbitrary fifth degree equation (given in Bring’s form) with the help 
of theta functions. 


§6.1. Solving cubic and quartic equations 


There are many known methods for solving third and fourth degree equations 
(also called cubic and quartic equations) in radicals. In this section we will only 
discuss the simplest of them. Certain other methods for solving cubic and quartic 
equations will be discussed in §6.3 and § 6 . 6 . 

Let us first observe that the equation x n + + ■ • ■ + a n = 0 can be 

reduced to the form y n + b 2 y n '~ 2 + • • • + b n — 0 with the help of the change of 
variables y — x + Therefore, it suffices to consider cubic equations of the form 
x 3 + ax + b = 0 and quartic equations of the form x 4 + ax 2 + bx + c = 0 . 

Cubic equations. Let us try to represent the roots of the equation 


( 1 . 1 ) x 3 + ax + b = 0 

in the form x = $/p + $/q. Then 

X 3 - p + q + 3 + tfq) = p + q + S^/pqx. 

Therefore, we should select p and q so that 


f S-^/pq — -a 
\ p + q^-b 


a“ 

pq = -Yl 

p + q — -b. 


Thus, we have derived a quadratic equation whose solutions are p and q. The roots 
of the quadratic equation are of the form 


(1.2) 



The formula x = + tfq gives 9 distinct possibilities. To obtain the right three 

values we should make use of the relation ^/p^/q = — |. 
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From this it follows that (1.1) has the solutions 


(1.3) 


*= i/p- 


a 

3^’ 


where p is found by formula (1.2). Here the value of x does not depend on the 
choice of the sign in front of the radical in (1.2). 

(It is easy to verify that the values of x determined by formula (1.3) are actually 
the roots of equation (1.1).) 


Quartic equations. First method. Let us try to represent the polynomial 
x 4 -f ax 2 + bx + c as the difference of two squares. To this end, let us use the 
identity 


x 4 + ax 2 + bx + c 


(z 2 +!+t) 


^2 tx 2 — bx + 


^t 2 + at — c + 



Select t for which the discriminant D = b 2 — 8 t (t 2 + at — c + j vanishes. Then 

x 4 + ax 2 + bx + c— (x 2 + ^ +1 j — 2 t ( x — — ) . 

Therefore, the equation 

(1.4) x 4 + ax 2 + bx + c = 0 

can be solved as follows. First, solve the cubic equation for t 


a* 


b A - St r + at - c + = 0. 


Let to be one of its roots. Then equation (1.4) can be expressed in the form 

, 2 + | +t „ = ± ^(^T). 

Second method. (Euler) Let X\, X 2 , X 3 , X 4 be the roots of equation (1.4). Set 
u — x 1 + £2 = - (#3 + X 4 ). Then 

x 4 + ax 2 + bx + c = (x 2 ~ ux + a) ( x 2 + ux + 0 ), 

i.e., 

a + (3 — u 2 ~ a, u [a — (3) = b, aj3 = c. 

From the first and the second of the above equations we get 

a = l{ a + u2+b u)’ ,3= 5( a + “ 2 "«)' 


Substituting these expressions into the third equation we get 
(1.5) u 6 + 2 au 4 + (a 2 — 4c) u 2 — b 2 = 0. 
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Equation (1.5) is a cubic equation for u 2 . First, we solve this cubic equation; then 
we find the 6 roots of equation (1.5). They are of the form ±u±, ±u 2 , dtu 3 . We 
may assume that 

X\+X 2 =Ui, X 3 +X4 = ~Ui, 

X X +X 3 =U 2 , X 2 +X4~ ~~U 2 , 

X1+X4— U 3 , X 2 +X 3 — ~U 3 . 

Then u x + u 2 + u 3 = 2x x . 

Third method. Instead of equation (1.4) we may solve the system of equations 


f / = y - x 2 = 0, 

\g = y 2 + ay+ bx + c — 0. 


The second equation of this system can be replaced by Xf + g — 0. The second 
degree curve Xf + g = 0 represents a pair of straight lines if and only if 

-A 0 
0 1 


(1.6) 


qT A 
2 


b 

2 

ctTA 

2 


0. 


If Ao is a root of the cubic equation (1.6), the equation Ao f + g 
two linear equations and, therefore, the system 

r / = o 

l A 0 / + g = 0 


0 factorizes into 


is easy to solve. 


§6.2. Symmetric polynomials 

A polynomial f(xi,...,x n ) is called symmetric if for any permutation 
(ii,..., in) we have f(x ^,..., Xi n ) = f(x i,...,x n ). The elementary symmetric 
polynomials ai(x i,..., x n ), defined by the relation 

(x - x\) •••{x — x n ) — x n - a 1 x n ~ 1 + a 2 x n ~ 2 -h (~l) n cr n , 

are the most important examples of symmetric polynomials. Thus if x \,..., x n are 

the roots of the polynomial x n + a\x n_1 H-1 -a n , then ai(x i,..., x n ) = (~1) ? a^. 

It is convenient to set cr^Xi ,..., x n ) = 0 for k > n. 

6.2.1. Theorem (Main theorem on symmetric polynomials). Any symmet¬ 
ric polynomial f(x \,..., x n ) can be represented in the form of a polynomial in 
elementary symmetric polynomials, i.e., f(x i,... ,x n ) — g{(7\, ■ ■ •, a n ), where g is 
a uniquely defined polynomial. 

Proof. It suffices to consider the case when / is a homogeneous polynomial. 
Let us order monomials lexicographically, i.e., say that ax™ 1 ■ ■ ■ x™ n is greater than 
bx f 1 • • • xfr if ai = Pi ,..., a k — (3 k , ak+i > Pk+i (k can be equal to 0). The great¬ 
est monomial of a polynomial is called its leading term. The following properties 
of lexicographic order are easy to verify: 

(1) The leading term of the product of two polynomials is equal to the product 
of their leading terms. 

(2) If ax™ 1 • • ■ x™ n is the leading term of a symmetric polynomial, then a.\ > 
a 2 > • • • > at n . 
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Let the leading term of a homogeneous symmetric polynomial / be equal to 
ax ® 1 • • • x® n . Let us consider the polynomial aa® x ~® 2 a ® 2 ~® 3 • • • a® n . By property 
( 1 ) its leading term is equal to 

ax^^ix® 2 -®^® 2 -® 3 ) ■ ■ • ( xf n ■■■x® n ) = ax ® 1 •••<"■ 

Therefore, the degree of the leading term of the polynomial 

fi = f — aa ® 1 -® 2 ^ 2- ® 3 • • • < 7 “” 

is lower than that of /. Let us apply to /i the same operation as to /, and so 
on. Since the number of monomials whose degree is lower than that of the leading 
term of / is finite, it follows that after finitely many operations we get the zero 
polynomial. This means that 

/ = aa® x ~® 2 a^~® z •■•<" + ••• = g(or i,... ,a n ). 

Now we prove the uniqueness of the representation / = g(ai, ■ ■ ■, cr n ). It suffices 
to verify that if g{a \,..., a n ) ^ 0 for any collection aq,..., x n , then g is the zero 
polynomial. Let g be a nonzero polynomial (over a field of zero characteristic). Then 
g(a\,..., a n ) ^ 0 for a collection {aq,..., a n }. It is also clear that if aq,..., x n are 
the roots of the polynomial 

x n - aix n ~ l + a 2 x n “ 2 -f (-l) n a n , 

then Gi(x\, ..., x n ) = a^. □ 

Remark. In Theorem 6.2.1, if the coefficients of / are integers, then the coef¬ 
ficients of g are also integers. 

The main theorem on symmetric polynomials plays an important role in the 
theory of algebraic equations for the following reason. Let x \,..., x n be the roots 

of the polynomial x n +aix n ~ 1 - \-a n . Suppose a polynomial f(xi ,... ,x n ) does 

not change under any permutations of its roots. Then it can be represented as a 
polynomial of the coefficients ai,... ,a n . We will often use this property. 


Problems 

6.2.1. Let Sk(x i,..., x n ) = x\ H-b x^. Prove that 

Sk - Sk-iGi + Sfc—202-b (-1)* ka k = 0. 

§6.3. The Lagrange resolvents 

We can solve the quadratic equation x 2 + ax + b — 0 as follows. Let X\ and X 2 
be the roots of this equation. Then 


^ ^ r _ 

(3.1) xi = - [(sq + x 2 ) + (»i - x 2 )] = - (xi + x 2 ) + V {xx - x 2 ) 2 

Here x\ + xq and (x\ — .x 2 ) 2 are symmetric functions of the roots and, therefore, 
they can be expressed in terms of the coefficients a and b. The explicit expressions 
are as follows: 

xi + x 2 — -a, (xi — X 2) 2 = (x\ + X 2) 2 — &x\X 2 = a 2 - 4b. 
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Thus, 



4 y/ a 2 — 4b 


The root \/a 2 - 46 has two values. One of them yields one root of the equation, 
the other value yields the other root, 

We can apply a similar approach to the solution of a cubic equation. Here, 
instead of square roots, we get cubic roots. Let a be a primitive cubic root of unity, 
i.e., a 3 — 1 and a ^ 1 . 

For the roots of the cubic equation x 3 4 ax 2 4 bx 4 c = 0 the analog of formula 
(3.1) looks as follows: 


x i — g [(#1 + x 2 + aq) + (aq 4 ax 2 4 a 2 xz) + (aq 4- a 2 x 2 4 0 : 2 : 3 )] 

(3-2) 

(xi 4 x 2 + £ 3 ) 4 yj(xi 4 02:2 4 a 2 xs) S 4 yj(x\ 4 a 2 i 2 4 02 : 3) 3 
To make use of this formula we have to calculate 



u — 


{■xi 4 02:2 4 a 2 x 3 ) 


3 


and 

v — (aq 4 oc 2 x 2 4 02 : 3) 3 . 

Under any permutation of the roots, u turns either into itself or into v. (To 
verify this, factor out a k from aq in the expression obtained after permutation 
of roots.) Thus, u 4 v and uv are symmetric polynomials of aq, aq and 2:3 and, 
therefore, they can be expressed in terms of the coefficients of the cubic equation. 
After these expressions are obtained, solving the cubic equation reduces to solving 
a quadratic equation. 

This method for solving a cubic equation was independently suggested by two 
French mathematicians, Lagrange and Vandermonde. They offered it simul¬ 
taneously in 1770. Vandermonde presented his work to the Paris Academy, while 
Lagrange, who worked during 1766-1787 in Berlin, reported his results to the Berlin 
Academy. 

Vandermonde’s work Memoire sur la resolution des equations contained a new 
approach to the solution of the third and fourth degree equations and also to certain 
other equations, in particular, the equation x 11 - 1 = 0 . Vandermonde’s paper 
was not published until 1774. During this time, in 1771 and 1773 two parts of 
Lagrange’s fundamental treatise Reflexions sur la resolution des equations were 
published. This treatise contained almost the same ideas as Vandermonde’s but 
these ideas were considerably more elaborate. Having acquainted himself with the 
works of Lagrange, Vandermonde never returned to this topic. 

On the base of the example above, introduce the following definition. Let 

x n 4 Gqoffi -1 H-b a n be a polynomial with rational coefficients, x 0 ,..., x n -i its 

roots. The Lagrange resolvents are expressions 

r(x 0 , a) = x 0 4 a 2 q -j-b cc n_ 1 2 : n _i, 

where a is the nth root of unity, i.e., a n = 1 . 
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The points a satisfying a n = 1 form the vertices of a regular n -gon and the 
points a k form the vertices of a regular m-gon, where m = nj (n, k). Therefore, 


Hence, 


E«* 


n k = 0 (mod n), 
0 k ^ 0 (mod n). 


nx o = ^r(x 0 ,a), 

(3.3) “ , 

nxk = y j ol r(x o,a). 

ot 

Thus, if the Lagrange resolvents are known, the roots of the equation can be cal¬ 
culated by formulas (3.3). 

Lagrange suggested a method for calculating resolvents that we will illustrate 
with the cubic equation. Let r = xq + ax\ + a z x 2 be a Lagrange resolvent for 
the cubic polynomial. Six possible permutations of the roots Xq, aq, x 2 give us six 
values ri,... ,tq. Let us consider the sixth degree polynomial 

g (t) = (t - n) {t - r 2 ) • • • {t - re). 

The coefficients of g are symmetric polynomials of r* and, therefore, they are sym¬ 
metric polynomials in Xq,Xi and X 3 ; hence, they can be expressed in terms of the 
coefficients of the initial equation. 

Lagrange called the equation g(t) = 0 the resolving equation. The point is that 
this equation is easy to be solved in radicals. Indeed, let n — xo + axi + a 2 .x 2 and 
r 4 = xq + ax 2 + o 2 .x'i. Then the remaining , r l are equal to 

r 2 — ocr 1, r 3 =^a 2 ri, r 5 = ar 4 , r 6 = a 2 r 4 . 


Hence, 

(t - ri)(t - r 2 )(t - r 3 ) = t 3 - rf, 

(t - r 4 )(t - r 5 )(t ~ r 6 ) = t 3 - r|, 

and, therefore, g(t) = (t 3 - rf) (t 3 - r|). Thus, the resolving equation g(t) = 0 is 
a quadratic equation for t 3 . 

Now let us find the resolving equation for the fourth degree polynomial. Here 
for the fourth degree root of unity it is convenient to take a — — 1 instead of the 

primitive root a = ±i. Then the resolvent r = Xq — aq + x 2 — x 3 takes, under 

permutations, only 24/4 = 6 rather than 4! = 24 distinct values and each value is 
taken exactly 4 times. These values are as follows: 

n = (® 0 + aq) - (®2 + *3) = ^4 5 

r 2 ~ («o + x 2 ) - (aq + x 3 ) = -r 5 , 

r 3 = (xq + * 3 ) - (*1 + x 2 ) = -r 6 . 

Thus, p(t) = h 4 (i), where h(t) = (t 2 - rf) (t 2 -r|) (i 2 -r§). Hence, the equation 
g(t) = 0 is equivalent to the equation h(t) = 0, which is a cubic equation for t 3 . 

There is no necessity to give the calculations of the resolvent for the primitive 
roots a = dbi. Indeed, 

(x 0 +xi+x 2 + x 3 ) + n + r 2 + r 3 = 4x 0 . 
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For the 5th degree polynomial the resolving equation is of degree 5! = 120. Let 

r = xq + olx i + c?x 2 + a 3 x 3 + a 4 X 4 , 

where a 5 = 1. If r\ is one of the values of r, then ar\, o?r\, a 3 ri and a 4 ri are also 
values of r. Therefore, the resolving polynomial g(t) can be factorized and 

h(t) = (t — ri) ( t — ari ) • ■ • (t — arVi) = t 5 — rf 

is one of the factors. It is easy to see that g(t) is the product of 24 factors of this 
form. Hence, g(t ) is a polynomial of degree 24 in i 5 . 

In spite of considerable efforts, Lagrange did not succeed in solving the equation 
g(t) = 0. This failure convinced him that it is impossible to solve a generic quintic 
equation in radicals. 

Nevertheless, for the equations that can be solved in radicals Lagrange’s method 
of resolvents turned out to be an important method for their solution. In the next 
section we will use it to solve in radicals the equation x n — 1 = 0 . 

§6.4. Roots of unity 

The equation x n — 1 = 0 admits an obvious solution in radicals: x — \/T. This 
solution is, however, unsatisfactory for the following reason. Dividing x n - 1 by 
x — 1 we get the polynomial 

fn(x)=X n ~ 1 +X n ~ 2 -\ - \-X + l. 

The degree of f n is equal to n — 1 and, therefore, the formula x = y/l provides us 
with n roots, one root more than we would like to have. We would like to get a 
formula for the roots of f n which does not involve roots of degree higher than n — 1 . 
This will be obtained in this section. 

First, let us consider several examples. The formulas for the roots of the poly¬ 
nomials (x) = x 2 + x + 1 and f${x) — (x + 1 ) ( x 2 + l) are elementary to obtain. 
So let us immediately pass to the polynomial fs(x) = x 4 + x 3 + x 2 + x + 1 . Set 
y = x+x -1 . Then the equation f-,(x) — 0 is equivalent to the equation y 2 +y—l = 0. 
The roots of the latter equation are 

-1 ± 

2 / 1,2 = -— 2 ~~’ 

and the roots of equations yi — x + x" 1 are 

Vb - 1 ± V-2V5 - 10 -a/5-1 ± V2V5- 10 

4 4 

The number a is called a primitive nth root (or a root of degree n) of unity if 
ot n = 1 and a k 7 ^ 1 for k = 1,2,..., n — 1. In this case the numbers a, a 2 ,..., a 71 ” 1 
are distinct. It is easy to verify that if a and j3 are, respectively, pth and qt\i 
primitive roots of unity, where p and q are relatively prime, then a(5 is a primitive 
pqih root of unity. Therefore, it suffices to consider equations f m {x) = 0, where m 
is a power of a prime. 

For the equation fj{x) = 0 we can, as before, make the change of variables 
y = x + x~ l . As a result we get the equation 

y 3 + y 2 ~ 2y — 1 — 0. 
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Its roots are given by the formula 


y 1,2,3 



J -7 + 21 v /=I 3 

V 6 + 



21>/—3 


Now the roots of the equation f-j{x) = 0 are obtained as solutions of the quadratic 
equations x 2 — piX + 1 = 0. 

For the polynomial / 11 (a) the change of variables y — x + x~ l does not give 
the desired result since this change of variables leads to a quintic equation, which 
is unclear how to solve. Lagrange’s method of resolvents enables one to overcome 
these difficulties. The numbers a, a 2 , ..., ct 10 , where a is one of the primitive 11th 
roots of unity, are the roots of equation 

(4.1) fn ( x ) = a 10 + a 9 + • • • + x + 1 =0. 


The degree of the polynomial fu is equal to 10, hence, to construct its Lagrange 
resolvents, we need a 10th root of unity. Let (3 be a primitive 10th root of unity. 
Let us order the roots of equation (4.1) so that each next root is the square of the 
preceding one: 


(4.2) 


a, 


a 10 , a 9 , a 7 , a 3 , a e 


Consider the Lagrange resolvent constructed from this sequence of roots: 

r*i = r(a, /?) = a + (3a 2 + (3 2 a 4 + /? 3 a 8 H-+ (3 8 a 3 + (3 9 a 6 . 

Under the change of the primitive root (3 with the root /+, where i = 1,..., 10, we 
get the Lagrange resolvents rj = r(a,(3 l ), where i — 1,..., 10. It is easy to verify 
that 

ri + r 2 H-1- no = 10a. 

Therefore, the 11th root of unity, a, will be found if we succeed in calculating 
r t ,...,r w . 

First, we calculate the r|°. Since a 11 = 1 and (3 10 = 1, it follows that ry can 
be represented in the form aijkP^a k , where 0 < / < 9, 0 < k < 10, and r}° 
can be represented in the form )>+ k bijkf3^a k , where 0 < j < 9, 0 < k < 10. 

Here and bijk are nonnegative integers. Let us express r}° as a polynomial 
of a, where the degrees of a are ordered as in (4.2): 

(4.3) rj° = y ifi {(3) + Pi,i(P)a + p i>2 ((3)a 2 + p i)3 (/?)a 4 + • • • 3r Pi,io{(3)a & . 

Let us prove that rl° does not depend on a, i.e., rj° = Pi{(3 ), where Pi is a 
polynomial with integer coefficients. It is easy to verify that r(a 2 ,(3' 1 ) = /3“V(a, / 3 l ), 
i.e., ri(a 2 ) = j3~ i r i (a). Hence, r}°(a 2 ) — rj°(a). Thus, denoting for brevity p^k {(3) 
in formula (4.3) by pk we get 

Po + pi a + p 2 a 2 + p 3 a 4 + p 4 a 8 H-h pga 3 + pioa 6 

= Po + Pi a 2 + p 2 Ct 4 + P3 a 8 + P4& 5 H-1- Pqo? + Pioa- 


Therefore, 

(4.4) (pio - Pi) a + (pi - p 2 ) a 2 + (P 2 - Ps) a 4 3 -+ (pg ~ P 10 ) « 6 = 0. 
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The following statement holds. 


6.4.1. Theorem. Let p be a prime, and let a and /3 be primitive roots of unity 
whose degrees are p and p — 1, respectively. Then the equation 

qi(P)a + q 2 (f 3 )a 2 +-1- q p -i(P)a p ~ 1 = 0, 

where qi,...,q p _ x are polynomials with integer coefficients, implies that q\((3) = 
92(/?) = • • • = q p -i(P) ~ 0. 

Although this statement (there are no nontrivial relations between a and (3, 
something like linear independence) seems almost obvious, its proof is rather com¬ 
plicated. It requires elements of Galois theory and, therefore, we will omit it, see 
[B6], p.97. 

Applying Theorem 6.4.1 for p = 11 to equation (4.4), we get p 10 — p 1 — p 2 = 
P 3 = ■ ■ ■ = pq. Therefore, 

r i° ~ Po(P) + pi({3) (a + a 2 +a 3 +-f a 10 ) = po{/3) — Pi(/3). 

Let us summarize. It is possible to represent a primitive 10th root of unity in 
the form of the product of primitive roots of degrees 5 and 2. Therefore, j3 can 
be expressed with the help of quadratic radicals. Next, we can represent r}° as a 
polynomial in (3 with integer coefficients. It remains to use the formula 



This formula gives 100 values instead of 10, as we would have liked. We have to 
select the 10 values needed among all the values obtained. Therefore, the formula 


(4.5) 


a = 


1 

10 


( r l + 1 * ‘ + Fio) 


1 

To 



rzrf 

<v*8 

'1 



is more convenient. The point is that the values r,;rj° *, as well as r}°, do not 
depend on a. To prove this, it suffices to observe that 


n{cx 2 )r \ 0 l (a 2 ) — (3 Vi (a) (p 1 r 1 {a)) 1 ° * = ri(a)r} 0_i (a). 

Therefore, formula (4.5) determines a uniquely after any one of 10 values of ri = 
\/rF is chosen. 

We can similarly solve the equation f p (x) = 0 for any prime p > 2 as well. 
Resolving the equation fn(x) = 0 we made use of the fact that the primitive 
11th roots of unity can be ordered so that each next root is the square of the 
preceding one. Such an ordering of roots is possible because the numbers 2 k , where 
k = 0,1,... ,9, have distinct residues modulo 11. A similar statement for residues 
modulo, for example, 7 is false but for any prime p there exists a number g, called 
a primitive element , such that the residues of the numbers g k , where k — 0,1,..., 
P ~ 2, modulo p are distinct. In other words, the multiplicative group of nonzero 
residues modulo p is the cyclic group generated by g. We will not prove this well- 
known statement. 

Let a and (3 be primitive roots of unity of degrees p and p — 1, respectively. If 
the residues modulo p of the numbers g k for k = 0,1,... ,p - 2 are distinct, then 
the numbers 

a , a 9 , a 9 ,..., a gP 
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exhaust all primitive pth roots of unity. Therefore, for a polynomial f p (x) we can 
consider the Lagrange resolvent 

ri = a + (3a 9 + (3 2 a 9 + ---+/3 p ~ 1 a 9 


Set Ti = ri(a,P l ). It is easy to verify that ri(a 9 ,(3 l ) = f3 l ). Hence, the 

quantities rf “ 1 and do not vary when a is replaced with a 9 . As in the case 

p = H 5 we can prove with the help of this property and Theorem 6.4.1 that r p 
and nrf -1-4 are polynomials in (3 with integer coefficients. For /3 an expression 
in radicals can be obtained by the induction because (3 is a root of unity of degree 
p — 1 < p. The formula for a is as follows: 


a — 


1 

p — 1 



P- 

r 2 rj_ 
v- 


+ 


p —4 

„p ~ 4 



This formula gives an unambiguous expression for a after one of the values of 


r i = 



is chosen. 


§6.5. The Abel theorem on the unsolvability 
in radicals of the general quintic equation 

Lagrange’s works spurred many geometers (this was the common name for all 
mathematicians of that time) to begin searching for a proof of the impossibility to 
solve in radicals the general quintic equation and higher degree equations. In 1788- 
1813 there appeared several papers of the Italian mathematician Paulo Ruffini 
(1765-1822). Following Lagrange, he considered substitutions of roots of equations 
and it was he who coined the term the group of substitutions. His series of papers 
culminated in the proof of the theorem on impossibility to solve in radicals the 
general equations of the fifth and higher degrees. 

Regrettably, this proof had an essential gap. Without justification Ruffini as¬ 
sumed that the radicals can be rationally expressed in terms of the roots of the 
initial equation (cf. Theorem 6.5.4 below). 

The Norwegian mathematical genius Niels Henrik Abel (1802-1829) was the 
first to give a complete proof of the theorem on unsolvability of the general quintic 
equation. He exposed his proof in the memoir Proof on Impossibility of an Algebraic 
Solution of General Fifth Degree Equations published in the first issue of Crelle’s 
journal in 1826. 

We say that the equation 

(5.1) F(x ) = x n + ciic n_1 4-1- c n = 0 

is the general nth degree equation if its coefficients ci,..., c n are independent vari¬ 
ables over the ground field L. In what follows we will assume that L = Q. 

Adjoining c\ ,...,c n to Q, we get the field A = Q(ci,..., c n ). This field is 
called the rationality field of equation (5.1). 

Having attached to A the roots au,..., a n of equation (5.1) we get the field 
A (F) = A(ai,..., a n ), called the normal field of equation (5.1) or the Galois field 
of this equation. 

We will say that equation (5.1) is solvable in radicals if A (F) is contained in 
the extension R of A obtained after attaching to A certain radicals 
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where 

®1 ^ ^2 ^ ^(Pl)> € A(pj , P 2 ) j • • ■ 1 G A(pj , . . . , p m _i) . 

Example. Let F(x) = ai 2 +cia;+c 2 . Then A = Q( Cl ,c 2 ) and A(F) = A( v /oT), 
where ai = cf — 4c 2 e A. 

Observe that the exponents p,q,...,s of the radicals pi, p 2 ,..., p m can be as¬ 
sumed to be primes. Indeed, if p = Irn, then instead of adjoining the radical 
Pi = tfa{ we may consecutively adjoin the radicals p = f/af and pi — ^/p. 
Therefore, in what follows we will only consider adjoining the radicals with prime 
exponents. 

Suppose that equation (5.1) is solvable in radicals. Adjoin to A the primitive 
roots of unity e \,..., e m whose degrees are equal to the degrees of the radicals 
Pi) • • •> Pm, respectively. Denote the obtained field by K. 

Since A C K, it follows that 

A(F) c A(pi,...,p m ) c K(pi ,..., p m ). 

To prove Abel’s theorem, we will need four auxiliary statements, Theorems 6.5.1- 
6.5.4. 

6.5.1. Theorem. Let p be a prime and k afield of zero characteristic. The 
polynomial x p — a is reducible over k if and only if a = b p for some b G k. 

Proof. Suppose x p — a — f (x)g(x), where / (x) and g(x) are polynomials over 
k. Let £ be a primitive pth root of unity and ,8 = f/a. Then 

f(x) = x r + cix r ~ 1 -j-1- c r = (x - s ni /3) ■■■(x- s nr (3). 

Hence, ±e l j3 r = c r € k, where l = m + • • • + n r . Since {s l ) p = 1, it follows that 
(±(3 r ) p = (c r ) p , i.e., a r — (±c r ) p . The number p is prime and 1 < r = deg / < p; 
hence, rs +pt — 1 for certain integers s and t. Therefore, a — a rs a pt — ( ±c r a t ) p = 
b p , where b = ±c r af G k. 

It is also clear that if a = b p , then x p — a is reducible because it is divisible by 
x-b. □ 

6.5.2. Theorem. Let s be a prime and a* e k — K(pi ,... If pi = 

v^i f k, then p\ k if and only if l is divisible by s. 

PROOF. If l = ns, then p\ = af € k since € k. Now suppose that p\ = a £ k 
and l — sq + r, where 0 < r < s. Then a = p\ — (ai) q p\ and, therefore, p\ — b, 
where b = a (ai)~ q e k. 

Over k, the polynomials X s — ai and x r ~ b have a common root pp hence, 
they have a common divisor whose degree does not exceed r < s. In particular, the 
polynomial x s - ai is reducible over k. Theorem 6.5.1 implies that ai = b s , where 
b € k. Clearly, b — epi, where e is a primitive root of unity of degree s. Since 
£ E K C k, it follows that pi G k. Contradiction. □ 

We may assume that Pi,..., p m is a minimal sequence of radicals (of prime 
degrees) required to compute a root a of equation (5.1), i.e., any other such sequence 
contains at least m radicals. In what follows we will only consider minimal sequences 
of radicals. Under this assumption the following statement holds. 
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6.5.3. Theorem. Let pi,...,p m be a minimal sequence of radicals needed to 
compute a root a of equation (5.1). Then a can be represented in the form 

a = u 0 + p + U 2 p 2 H-h u s ~ip s ~ l , 

where s is the degree of p m , p = f/a, a £ k = K(p \,..., pm- 1 ) an d u i € k. 

Proof. Since a £ K(pi ,... ,p m ) = fc(p m ) and p s m £ k, we have 

(5.2) a = 6o + b\p m + +-H 

where <E fc. The only difficulty is to ensure that bi = 1. By the assumption, 
a f k so that at least one of the numbers &i,..., 6 s -i Is nonzero. Let 5/^0 for 
some i such that 1 <1 < s. Set p - bip l m . Since s is a prime, ul + vs — 1 for certain 
integers u and v. Moreover, we have 

P —t) l p m —O l p m — O t a pm, 

i.e., p m - cp u , where c = bf u a v £ k. Since p m $ k, it follows that p £ k. It is also 
clear that p s — bfp l m = bf a 1 £ k. 

In (5.2) replace p m with cp u taking into account that bip l m — p. As a result, 
we get 

(5.3) a ~ bo + hcp u + b 2 c 2 p 2u + ••• + /? + ••• + b s ^c s ~ l p^ s ^ u . 

Theorem 6.5.2 implies that p* £ k if and only if t is divisible by s. Since u and s are 
relatively prime, the elements 1, p u ,p 2u ,are linearly independent over 
k and the set of these elements coincides with the set 1 ,p,p 2 ,..., p s ~ l (perhaps, 
ordered differently). Thus, formula (5.3) gives the required expression for a: 

a = bo + p + b' 2 p 2 + • - + K_ 1 p s 1 - □ 

6.5.4. Theorem. The minimal sequence of radicals pi,...,p m necessary to 
calculate a root a of polynomial (5.1) can be selected so that p \,..., p m ore polyno¬ 
mials over K of the roots a \,..., a n of polynomial (5.1). 

PROOF. Start with an arbitrary minimal sequence pi,...,p m . By Theorem 
6.5.3 we can replace p m with a radical p of the same degree s so that 

a — uq + p + u\p 2 + • • • + u s -1 p s 1 , 

where m £ k = K{p u ... ,p m -i) and p s = a £ k. Let us show that for any root f 
of the polynomial X s — a 

o:(£) = uo + £ + u^ 2 4— • + u s -i£ s 1 

is a root of polynomial (5.1). Substitute x = ct(^) in the polynomial 

F(x) = X n + C\X Tl ~ 1 H-1- c n . 

Taking into account that — a £k we get an expression of the form 

bo + + • ■ • + b s -i£ s 1 , 

where bi £ k. The polynomials X s — a and bo + biX-\ -b b s -ix s ~ l have a common 

root p; hence, they have a common divisor over k. By Theorem 6.5.1 the polynomial 
X s — a is irreducible over k ; hence, bo = b\ = • • ■ = b s -1 = 0. This means that if ^ 
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is a root of the polynomial x s — a, then a(£) is a root of polynomial (5.1). Let e 
be a primitive root of unity of degree s. Then £ = e r p\ hence, 

OLr +1 — Uq + £ r p + U2£^ r p 2 + • • • 4" U s —W p s 1 

for r = 0,1,..., s — 1 are roots of polynomial (5.1). 

For example, for 5 = 3 we get 

ot i = Uq + P + U2P 2 , 
a 2 = w 0 + ep + U 2 £ 2 p 2 , 

Oi3 = U 0 + S 2 p + U 2 £p 2 . 

Since 1 + e + e 2 = 0, we have 


Oi\ + «2 + = 3^0, 

ai + e~ x OL2 + £~ 2 Oi3 = 3 p, 

OL\ + £ 2 Oi2 + £ ^ 0:3 = 3ll2p 2 ■ 

Therefore, p = ~ (01 + £ 2 ct 2 + ea 3 ). For s > 3 we get more cumbersome formulas 
but the arguments remain the same. The proof of the theorem for the last radical 
p m is completed. 

Let us now turn to p m - 1 - We have shown above (for s — 3) that the expressions 
uq,p, U 2 P 2 ,..., w s _ip s_1 can be polynomially expressed in terms of roots ai,...,a„ 
of polynomial (5.1). Moreover, they lie in the field K(pi,, p m - 1 ), so that each 
of the values indicated can be represented in the form 

^0 + VtPm-l + V 2 pl l _ 1 + ' ‘ 

where Vi E K(pi ,..., p m _ 2 ). The sequence of radicals pi, ■ ■ ■, p m is minimal, so 
that the equations vi = V 2 = • • • = v t ~\ — 0 cannot be simultaneously satisfied for 
all the quantities because otherwise we could have excluded p m -\• Therefore, there 
exists a relation of the form 

^0 + Vipm-l + H-1- Ut -1 Pm-l = • ■ • j <*„), 

where Vi € K(pi,..., p m - 2 )) not all elements v \,..., v t ~i vanish and r(ai,..., a n ) 
is a polynomial over K. Consider the polynomial 

(jr (t) — (t r(a(j(i) , • • • , Q^a(n) )) ) 

where the product runs over all the permutations a E S n . The coefficients of G are 
the symmetric polynomials of the roots of polynomial (5.1), so that they can be 
polynomially expressed in terms of the coefficients of polynomial (5.1). Thus, G is 
a polynomial over K and 

ft = Vq + Vipm—i + ' ' • + 

is a root of this polynomial. It is also clear that the root (3 can be expressed 
by means of the radicals (with the help of the sequence of radicals pi,... ,p m ~ 1 ). 
By Theorem 6.5.3 replacing p m _i with the radical p' of the same degree we may 
assume that v\ — 1. We can now apply to p' the same arguments as we applied 
to p. Iterating the arguments for Pm- 2 , and so on, down to pi completes the 
proof. □ 
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Now we can pass to the proof of Abel’s theorem proper. 

6.5.5. Theorem (Abel). For n> 5 it is impossible to express the roots of the 
general nth degree polynomial in radicals. 

Proof. Suppose that a certain root ol\ of the general nth degree polynomial 

x n + c\x n '~ 1 + C 2 X n ~ 2 + • ■ • + c n 

can be expressed in radicals. Then by Theorems 6.5.1-6.5.4 there exists an expres¬ 
sion of in radicals of the following particular form. The root a.\ is obtained by 
consecutively adjoining the radicals pi,..., p m of prime degrees to the ground field, 
and these radicals, in their turn, are polynomials in the roots ot \,..., a n of the initial 
polynomial. More precisely, let £i,..., e m be primitive roots of unity whose degrees 
are equal to the degrees of the radicals pi,..., p m , respectively, A = Q(ci,..., c n ), 
and 

K — A(si, . . . , Em) — Q(S-1 ? • • • ? , Ci,. . . , C n ). 

Then ai can be polynomially expressed over K in terms of pi,..., p m , i.e., 

OL\ = r(pi, • ■ • , pm? Ci,. . . , C n ), 

where r is a polynomial over Q(ei,..., £ m )- l n their turn, pi,..., p m can be poly¬ 
nomially expressed over K in terms of or,..., a n , i.e., 

pi ~ r^ (ctl, . . . , OCni Cl, • • • , C n ), 

where r. L is a polynomial over Q(ei,... ,£ m ). Since we deal with the general poly¬ 
nomial of degree n, we may assume that a\,... ,a n are independent variables and 
d,..., Cn are (up to a sign) the elementary symmetric polynomials oi a\,... ,a n . 

Let us show that for n > 5 the assumption on solvability in radicals of the 
general algebraic equation of degree n leads to a contradiction. To this end consider 
the permutation 

_ /123456... n\ 

T ~ \234516 ...n) 

that cyclically permutes the first 5 elements, the others being fixed. Let us prove 
that under the action of T on the roots oti,...,a n the first radical pi does not 
change. Since 

pi =ri(ai,...,a n ,ci,...,c n ) - 

where oq is a polynomial of Ci,...,c n over the field Q(ei,... ,£ m ), the equation 
p v x = ai can be considered as a relation of the form 

ip{(Xly . . . , Oin? Cl, . . . , C n ) —- 0, 

where <p is a polynomial over Q(ei, ■ • • , Em) ■ 

Let us show that any relation of this form is preserved under any permutation of 
the roots ai,..., a n . Let (3\ = ,..., /3 n = a:j n , where i \,... ,i n is a permutation 

of the numbers 1,2,... ,n. Then 

<p(/3i, • • • ? /3n? di,..., d n ) — 0, 

where di = Ci((3 1 ,... ,/5 n ). Clearly, di = Ci{oc \,.. .,Oi n ) = c 7 ; because the functions 
C{ are symmetric. Hence, 


, • • • ? Eii n , C \, . . . , C n ) 0 . 
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Thus, the relation p\ — a-{ is preserved under the action of T on the roots 
i.e., T{p \) = T(a\). Clearly, T(pf) = T(pi) p . Since ai only depends 
on the symmetric functions of roots, T(ai) = a\. Therefore, T(pi) — e A pi and 
T m (pi) — £i lX pi. But T 5 = I is the identity substitution, hence, ef A pi = T 5 (pi) = 
Pi, i-e., ef A = 1. 

Let us now turn to the substitutions 

/123456 ...n\ /123456 ...n\ 

~ V 124 536...nJ’ “ \231456...n/ 

It is easy to verify that U 3 — V 3 — 1; hence, U(pi) = dj'pi and V{pi) = e£p 1} and 
e 3 / = = L Moreover, UV = T; hence, 

T(p l ) = VU(p 1 )=ef'p 1 . 

Hence, = e^ +L/ so that e, = e® A ef 5A = = 1 because e, A = = 1. 

As a result we get T(pi) = p\. 

Passing consecutively to the radicals p %,... ,p m we similarly get T(pi ) = pj for 
i = 2,... ,m. 

Since pi —ri(a i,..., a n , Ci, ..., c n ), it follows that the equation 

Qt\ ^(pl j • • • ) Pmi Cl, • • • , Cn) 

can be considered as a relation between a\,... ,a n ,ci,... ,c n over Q(si,..., e m ). 
This relation is preserved under the action of T, i.e., 

T{ ai ) = r(T(pi), ..., T(c n )) = r(p l5 ..., c n ) 

since T(ci ) = c* and T(pi) = pi. Therefore, T(a i) = a\. On the other hand, by 
the definition of T we get T(a i) = a 2 ] hence, u\ = a 2 - The relation ai = a 2 
contradicts the independence of the roots of the general equation. □ 


§6.6. The Tschirnhaus transformations. 

Quintic equations in Bring’s form 

In 1683 in the journal Acta Eruditorum E.W. von Tschirnhaus 1 (1651-1708) 
published a method for transformation of algebraic equations which, Tschirnhaus 
believed, enabled one to solve in radicals the equation of any degree. Leibniz 
immediately announced that Tschirnhaus’ claim on the universality of this trans¬ 
formation was not valid. The catch is that in order to solve a quintic equation with 
the help of the Tschirnhaus transformations one has to solve an equation of degree 
24. 

Still, the Tschirnhaus transformation has important applications. For example, 
with its help any quintic equation without multiple roots can be reduced to the form 
y 5 + by — a and in the process we only have to solve equations of degrees 2 and 3. 
In Chapter 7 we will show that equations of such a form can then be solved using 
theta functions. 


Mhe mathematicians often write Tschirnhausen, but as is clear from the works of historians of 
mathematics, the correct spelling is Tschirnhaus. (Regrettably, his original works were inaccessible 
for us.) The authors. 
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The Tschirnhaus transformation of the equation x n + Cix n ~ l +-1- c n = 0 is 

as follows. Let x 1} ... ,x n be the roots of this equation. Let us consider a rational 
function p that is finite at the points aq, ..., x n . 

Set yi — p{xf) and let 

y n + qiy n ~ 1 + --- + q n = 0 

be the equation for which y\,. .., y n are the roots. Further on, we will show that 
if this equation has no multiple roots, then the aq can be expressed in terms of the 
yi. By selecting an appropriate p we may assure that the coefficients qi,, q n -i 
become zeros. But to do this we have to solve an equation of degree (n — 1)! and 
this was precisely the objection of Leibniz. 

Without loss of generality we can use a polynomial of degree not higher than 
n — 1 instead of the rational function ip. We can do this because of the following 
statement. 

6.6.1. Theorem. Let xi,...,x n be the roots of a polynomial f whose degree 
is equal to n and p = P/Q, where P and Q are polynomials such that Q(xi) ^ 0 
for i = 1,..., n. Then there exists a polynomial g of degree not higher than n — 1 
for which the values of g at points aq,... ,x n coincide with the values of p at these 
points. 

Proof. By the assumption the polynomials / and Q have no common roots, 
hence are relatively prime. Therefore there exist polynomials a and b for which 
af + bQ = 1. Since /(aq) = 0, we have 6(aq) = 1 /Q(aq). Hence, 

p(xi) = P(xi)/Q(xi) = P(xi)b(xi). 

Thus, for g we can take the residue of the division of Pb by /. □ 

In the sequel we will assume that to the equation 

f{x) — x n + CiT 71 " 1 H-f- c n = 0 


the transformation 

y - g(x) — Po + PlX + ■ • • + Pn-lX n ~ l 

is applied. Let us show how to calculate in this case the coefficients of the polyno¬ 
mial y n + (qy n ~ 1 + • • • + q n whose roots are yi = g(xi),i — 1 ,..., n. For simplicity, 
we confine ourselves to the case n — 3 . 

If x 3 — —cix 2 — C2X — C3, then 

yx = pox + pxx 2 +p 2 (-cia; 2 - c 2 x - c 3 ) = p ' 0 +p' 1 x + p , 2 x 2 . 

Similarly, yx 2 — pQ + pfx + p 2 x 2 , where the p'( are linear functions of the 
parameters pi. Therefore, if aq is a root of / and y lt = g(xi), then the system of 
equations 

{ (Po ~ y)zo +P1Z1 +P2Z2 = 0 , 

PqZq + (pi - y)z x + p' 2 z 2 = 0 , 

PqZo + PlZl + (P2 - y)z 2 - 0 
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has a nonzero solution (zo,Zi,z 2 ) = (1 ,Xi,xf). Set 


Po 

Pi 

Pj\ 

Po 

p'l 

to 

Po 

Pi 

p'i 


Then det (A-yl) — 0 for yi — g(xi). If the polynomial det (yI~A) has no multiple 
roots, it coincides with the desired polynomial y n + q\y n ~ l + • • ■ + q n . Since the 
elements of A linearly depend on the parameters pi , the coefficient is a kt\\ degree 
polynomial of the parameters pi. 

If y n + qiy n ~ l H-1 -q n has no multiple roots, A has no multiple eigenvalues. 

Therefore, to each eigenvalue of A there corresponds a unique (up to proportion¬ 
ality) solution of system (6.1). This means that from a root yi of the transformed 
polynomial a root Xi of the initial polynomial is uniquely recovered and that the 
root Xi can be rationally expressed in terms of the root yi- 

The Tschirnhaus transformation enables one to solve in radicals the equations 
of degrees 3 and 4. A cubic equation can be reduced to the form y 3 + q 3 — 0 after 
we solve the system depending on the parameters Po,Pi,P 2 consisting of a linear 
equation q\ — 0 and a second degree equation q 2 — 0. To do this, we have to solve 
a quadratic equation. 

A fourth degree equation can be reduced to the form 

y 4 + Q 2 y 2 + 44 = o. 

To do so, we have to solve a system consisting of a linear equation q\ = 0 and a 
third degree equation q 3 — 0, which reduces to solving a cubic equation. 

A fifth degree equation can be reduced to the form 

y 5 + q4V + 95 - o 

after the system of equations qi — q 2 — g 3 = 0 is solved. To this end we have to 
solve an equation of degree 6. A more detailed analysis was performed in 1789 by 
the Swedish lawyer, historian and mathematician Bring, who demonstrated that in 
this case instead of a sixth degree equation it suffices to solve equations of degrees 2 
and 3. Indeed, in order to satisfy the equation q\ = 0, express one of the parameters 
Po, • ■ ■ iPi as a linear function of the other parameters. Then the coefficient q 2 is a 
quadratic form with respect to four of the parameters Pi. This quadratic form can 
be reduced to the form u\ + u? 2 - v\ - , where Uj and vj are linear functions of 

the Pi (to perform the reduction, we have to calculate square roots). To satisfy the 
equation q 2 = 0, it suffices to solve the system of linear equations Ui = v\ , u 2 = v 2 . 

This leaves us two parameters and the equation q 3 — 0 for them is a third 
degree equation. As a result, we get an equation of the form y 5 + q 4 y + q 5 — 0. 

If (?4 yk 0, then with the help of a linear change of variables we can reduce this 
equation to the form y 5 + 5y = a. 




CHAPTER 7 


Theta Functions and 
Solutions of Quintic Equations 


§7.1. Definition of theta functions 

Theta functions are entire functions with one genuine period and one quasiperi¬ 
od (though under addition of the quasiperiod to the argument the function changes, 
but this change is subject to a sufficiently simple law). 

Setting q = e i7rT one can map the upper half plane H — (r G C | Imr > 0} in 
the interior of the unit disk D — {q € C j |g| < 1}. Indeed, let r = x + iy , where 
x,y € M. Then q = e l ' KX ~' Ky — e~ n v e ™x and \q\ — e _7ry . Hence, \q\ < 1 if and only 
if y > 0, i.e., Imr > 0. 

Fix a number r G H and consider the series 

OO OO 

0 3 (u|r)= e( m2T+2mv ) ni = 

m=—oo m=—oo 


E 


m 2 2irimv 
q e 


The absolute value of the ratio of the consecutive terms of this series is equal to 
|^2m+i e 27nw| < |g| 2m+1 e 2 ^ v K Since limra-^ ]g | 2m+1 = 0, it follows that @ 3(0 | r) 
is a series of entire functions of v converging uniformly in the domain |?;| < c, where 
c is a constant. Therefore, ©3(1; | r) is itself an entire function of v. For brevity we 
will often denote this function by © 3 ( 1 ;). 

By replacing v with v + 1 we get the same series, since e 2lT ‘ l ‘ m ( v + 1 ) — e 27r ‘ irnv , 
This means that @ 3(11 + 1 ) = © 3 ( 1 '). Moreover, & 3 (v + r) = A@ 3 (u), where 
A = q~ 1 e~ 2mv . Indeed, 

0 3 + r ) = J2 q m2 e 2niTnv q 2rn 

= q -l e - 2 ™^2 q (m+l) 2 e 2ni(m+l) _ g -l & -2niv 


It is also convenient to consider the functions 


©o(v) = ©3 (v + ^ O' 

©i(v) =ie-™( v -*)e 3 (v + 


„2itimv , 


1 — r' 


■im _ 'y ^ ^_l) m qin 2 ^2Trimv. 


%e 


e“ ."“e—e 


m g2irimv „rrim ^—iximr 

i (-l) m q ( m -iY 
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(the function q x is multivalued for A ^ Z; we take q x = e t7rXr ); and 

02(v) = e -^(«-i)0 3 (v-Vj = e -"i( v -t) Y,q m2 e 2 ” imv e-" imv 

— y^^ q {rn-\f e -ni{2m-l) v _ 

The functions ©o, ©i, O 2 and ©3 are called theta functions. 


Problems 

7.1.1. Prove that Qk{v + 1) = ©fc(u) for k — 0,3 and ©fc(u + 1) = —©&(?>) for 
k = 1,2. 

7.1.2. Prove that 0fc(u + r) — AO k (v) for k — 2,3 and ©&(u + r) = -^4©fe(h) 
for k — 0,1. 


§7.2. Zeros of theta functions 

Let the numbers m and k be related by the formula 2m —1 — 1 — 2 fc, i.e., A: = 
1 -m. Then (~l) m = - (-l) fc and g( m " 5) 2 = g(M) 2 , so that ©i(-u) = -©i(v). 
In particular, ©i (0) = 0. 

Since ©i(v + l) = -©i(i>) and ©i(u + r) = -j4©i(w), we have ©i(m+nr) = 0. 
Let us show that the function ©1 has no other zeros. It suffices to verify that the 
function ©1 has only one zero inside the parallelogram II with vertices ±l ^ T -. 
Clearly, 


0 ^ + f) = gUu), 

©l(u + l) ©i(v)’ 

Q'ifo + r) = —^4 / (u)©i(u) yf(u)Q^(u) 

©i (v + t) —A(v)Qi (v + 1) 

1 e ;w _ 1 9'iW 

A(o) + ei(o) + ©i(W 

Therefore, if C is the boundary of II oriented counterclockwise, then 


1 

27TZ 



gia 

©i(^) 


dz = 


1 . 


Indeed, the sum of the integrals along the horizontal sides is equal to 2iri, since 
the lengths of these sides are equal to 1 , whereas the integrals along the other two 
sides of II cancel. 

If f(z) -c k {z- a k ) k H-, then + • • •. Therefore, for an entire 

function / the value 

_L f lM dz 
2 ™ Jo m 

is equal to the number of zeros (multiplicities counted) inside C. Therefore, inside 
II, the function ©1 has exactly one zero. 
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Starting from the expressions for the functions 0 O , ©i and 02 in terms of ©3, 
we can easily get the following table for zeros of theta functions: 

function ©o(u) © 1 ( 11 ) © 2 ( 1 ;) ©3 (a) 


its zeros m + (n + |) r m + nr 


m + \ -f nr 


m + i + (n+|)r 


Problems 

7.2.1. Prove that the functions ©o ,©2 and ©3 are even. 


§7.3. The relation ©| = @| + © 4 


The quantities ©,; = ©,; (0) for i — 0,2,3 and ©^ = 0' x (0) are called theta 
constants. Recall that they depend on the parameter r. 

Consider the function 


fiv) = 


q©l(u) + bQl(v) 

©§(«) 


where a, b € C. 


The numbers 1 and r are periods of /; hence, / is a doubly periodic function with 
fundamental parallelogram II. The fundamental parallelogram can be shifted so 
that only one zero of the function ©o(f) lies inside it; assume that it is If we 
select numbers a and b so that a© 2 (^) + 6 © 2 (|) = 0, then, inside II, the elliptic 
function f(v ) has a pole of multiplicity not greater than 1 and, therefore, f(v) is a 
constant. 

Substituting v = ~ and v = 0 , respectively, in the relation 




we get 

02 (D = e_ ^© 3 (0), resp ' ®a(0) = e^© 3 = ^©3 (0 . 

Hence, a©|(^) + 60 2 (^) = aR 2 ©! + 6R 2 © 2 , where B — e~^. Set a = —©| 
and b = © 2 . Then aR 2 © 2 + 6R 2 © 2 = 0 and 


—0 2 (u)0 2 + ©3(^)©3 = c©q(u). 

To compute c, set v — The function ©2 vanishes at this point, so to calculate 
©3(5) and © 0 ( 5 ) we can substitute v — 0 and v — \ in the relation 0 o (u) — 
© 3 (w + |). We get ©o = © 3 (|) and © 0 ( 5 ) — © 3 (1) = © 3 . Therefore, 


©o© 3 c © 3 > 

i.e., c = ©q. Thus, 

© 3 W0!-e2(u)©I = ©§(u)0o. 
In particular, for v = 0 we get 

e| = e| + el 


i.e., 

(l+2q+2q 4 +2q 9 + - ■ • ) 4 = 16q{l+q V2 +q 2 - 3 +q 3 - 4 + - ■ ■ ) 4 + (l-2q+2q 4 -2q 9 + - ■ • ) 4 . 
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§7.4. Representation of theta functions by infinite products 

The numbers m+| + (n+|)r are zeros of the function 

OO 

© 3 (v) = Y, q k2 e 2lxikv . 

k~ — oo 

Set s = e 27rw . This substitution sends the zeros of 0 3 (u) to the points 

e27 ri( m+ i) e27 ri(n+4r) = _ q 2n+l ? where g = ^ir _ 

Let us put these points into the two sets: 

(4.1) - q ~\ -q~ 3 , 

(4.2) -q 1 , -q 3 , -q 5 ,.... 

The limit point of the set (4.1) is oo and the limit point of the set (4.2) is 0. 
The series YlT-i \q' 2k ~ 1 1 converges; hence, the function 

OO 

AM-ill 1 ***- 1 ') 

As=l 

is an entire function of s with zeros at the points (4.1). Similarly, the function 

OO 

Aw-n(i+«“- , «- 1 ) 

k -1 


is an entire function if we consider it as a function of s _1 (as a function of s it has 
a singularity at the origin). The zeros of / 2 (s) are the points (4.2). 

Consider the function f(s) — /i(s)/ 2 (s). The function g(v) = f(e 27riv ) = /(s) 
has the same zeros as the function 0 3 (u). Under the change of variables v v + 1 
the value s = e 2 ™ does not change and, therefore, g(v + 1) ~ g(v). The change of 
variables v v + r replaces s with e 2ltiv e 2 ' nir — sq 2 ; hence, 

g {v + r) = - 1 - H " q S g(v) = q~ 1 e~ 27Tiv g(v). 
t + qs 

Therefore, the ratio of the functions 0 3 (t;) and g(v) is an entire doubly periodic 
function, i.e., a constant. Therefore, 

OO 

(4.3) 0 3 (v) = c JI (1 + q 2k ~ 1 e 2 ' niv ) (l + q* k ~ l e - 2ltiv ) . 

fc=i 

Similar expansions can be obtained for the remaining theta functions. Since 

e 27ri(„+§) _ e -2niv, j t f 0 H 0ws that 

OO 

= c fl (1 - q 2k ~ l e 2 ™) (1 - (? k - l e~ 2 ™) . 

k—1 



(4.4) ©o(a) = 0 3 







§7.4. REPRESENTATION OF THETA FUNCTIONS BY INFINITE PRODUCTS 


Since _ _ q 2 k- 2 e 2 -niv and g 2 fc-l e - 2 iri(t)+|-f) _ _ q 2 k & - 

have 

e 1 (») = <e-’ ri » 9 ie 3 ^ + 5-0 

oo 

= i (1 - e 2iriv ) e-^oU (1 - q 2k e 2 ™) (l - q 2k e~ 2 ™) . 

k =1 

It is also clear that i (l — e 2?rTO ) e~'' r ™ = 2sin7ru, so that 

OO 

(4.5) ©i(v) = 2(sin7 rv)gicU (l - q 2k e 2lTiv ) (l - q 2k e~ 2 ™). 

k =1 

It is easy to verify that 02(w) — €>i(v + |); hence, 

OO 

(4.6) © 2 (u) = 2 (cost tv) gic U (1 + <? 2fc e 2 ™) (l + q 2k e ~ 2 ™). 

fc=i 

Now, let us prove that 

OO 

(4.7) c = n (! - «") • 

fc=l 

Consider the sequence of functions 

F n {s) = fj (1 - g 2/c_1 s) (l - q 2k ~ 1 s~ 1 ) = 52 a k (n)s k . 

&=1 k=—n 

This sequence converges uniformly to the function 

OO - -j OO 

J[ (1 - q^-'s) (1 - g 2fc - 1 s- 1 ) - i©o(v) = - J2 (-1) VV. 

fc=l fc=—oo 


Comparing the coefficients of degree zero in s, we get ~ = lim n ^ >00 ao(n). 
Clearly, a n (ri) — (—l) n g 1+3+ ”' + ( 2n “ 1 ) = (—1 ) n q n2 . Moreover, 

F n (q 2 s) (l - q 2n+1 s) (l - g - 1 s -1 ) _ 1 -q 2n+1 s 

F n (s) (1 — qs) (1 — g 2 n-i s -i) qs — q 2n ’ 


hence, 

n n 

(qs - q 2n ) ^ afcfa)? 2 * 3 * = - (l - g 2 n+ 1 s) 52 a fc( n ) sfc > 

k~—n k=—n 


i.e., 

n n 

52 Ofc(^) (l - g 2 ( n + fc )^ s fc = 52 OA:(n) (g 2n+1 — # 2/c+1 ) S k+l . 

k=—n k=—n 

Therefore, 


a 0 (n) - q n2 


nLi (i - ? 2(,>+t) ) 

IE-, 


nLi (i - « 2< " +t) ) 
IK-Ui-* 2 *) ' 


2iriv 

? 
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Let |g | 2 = a < 1. Then 1 — a n < |l-g 2 ( n + fc )| < 1 + a n . Moreover, 
lim^oo nln (1 ± a n ) = 0. Hence lim^oo (l — q 2 ( n+k )) = 1 and 


c = lim l/a 0 (n) = TT (l - q 2k ) . 

n—>oo 

fc -1 

§7.5. The relation 0^(0) = 7r0o(O)02(O)©3(O) 
Formulas (4.4)-(4.7) imply that 


(5.1) 

(5.2) 

(5.3) 

(5.4) 


0 o ( 0 ) 
02 ( 0 ) 
03(0) 
01 ( 0 ) 


fc =1 

OO 

(1 + q 2k f \ 

1 

OO 

ciit+n 2 ; 

/c=l 

OO 

2nqh (l -g 2fc ) 2 


fc=i 


To prove the last formula, it suffices to observe that 0' x (0) = lim^o . 

Since (l - q 2k ) 2 = c 2 , we have 0^(0) = 2irq^c z . Therefore, to prove the 
relation 0^(0) = 7 r 0 o(O)© 2 (O)© 3 (O), it suffices to verify that 


OO 


ni 1 -^" 1 ) (l + ^ 2fe ) (1 +q 2k ~ 1 ) = 1. 


fc =1 


Clearly, 


II(i-s 2 *- 1 ) = n (i -«”) (i - ? 2 T 1 ; 

k =1 n=1 

OO OO 

n (!+«“) (!+<?—‘)=nc + q n )- 


&=i 


n—1 


Therefore, 


11(1- 5 “- 1 ) (1 + 9 a ) (1 + = II (1 - 9") (1 + 9 n ) (1 - 9 2 ”)' 1 

71=1 
OO 

= II ( 1 -? 2 ")( 1 “ 9 2 ")“ 1 = 1 - 


fc = l 


71=1 
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§7.6. Dedekind’s 77 -function and the functions /, / 1 , / 2 

Formulas (5.1)-(5.4) together with (4.7) enable us to represent the theta con¬ 
stants in the following form: 1 

OO 

®i(0) = 27T?7 3 (t), t/(t) = (1 - q 2k ) ; 

k=l 

bo 

0 3 (O) = rj (r)/ 2 (r), /(t) = q~& JJ (l + q 2k ~ l ) ; 

k—1 

00 

©o(0) = v( T )fi{ r )i fi{r) = q~& JJ ( X ~ 5 2fe_1 ) ; 

k= 1 
00 

©2(0) = r7(r)/ 2 2 (r), / 2 (r) = V 2 g& (l + q 2k ) . 


fc=i 


The relation 0|(O) = @1(0) + ©q( 0) implies that 

( 6 - 1 ) / 8 = /i 8 + /f. 

In §7.5 we have shown that 




k~l 

Hence, 

(6-2) Jhh = V2. 

The functions /, /1 and / 2 can be expressed in terms of 77 as follows: 


(6.3) 

(6.4) 

(6.5) 


f( T ) 


AW = 


v( T ) 


!LM 

7?(t) 


AM = 

rj{T) 


Let us prove, for instance, formula (6.3). It is easy to verify that 

2k 


V u_i. f il±\ 12 7-7 ^ 

~rrfr)~ = 924 " I" 2 ) II- 


( 792 ) 


fc=l 


1 ~q 


■2k 


q 24 e 24 


si (1 + g) (1 ~ tf 2 ) (1 + Q 3 ) (1 - Q 4 ) • • • 


(1 - q 2 ) (1 - q 4 ) ■ • 


— e™q 24 ]^[ (l - < 7 2/s 2 ) . 


fc=i 


It is even easier to prove formulas (6.4) and (6.5). 


1 Observe that the functions /(r), /i(r), / 2 (r) introduced below and the functions /(s), fi (.s), 
f 2 (s) from §7.4 are different functions. 
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§7.7. Transformations of theta functions 
induced by transformations of r 

In §7.1 we have seen how the functions 0* (u|r) behave under the replacement 
of v with v + 1 or v + r. It turns out that under the replacement of r with r + 1 or 
— i the functions ©,;(i>|r) are also transformed according to relatively simple laws. 
For the change of parameter tht + 1 this is no wonder, since under such a change 
of parameter q — e niT turns into q' — —q. Therefore, 

©o(u|r + 1) = © 3 (u|r), 0 3 (u|r + 1) = © 0 (u|t), 


®i{v\r + 1 ) = e™ @i(v\r) fori = 1,2. 


The change of parameter r r' = — however, sends q — e 1TlT into q' — 
e~ 7Tl / T — exp(—7i 2 / Inq). It is surprising that there is a transformation law for 
the theta function for such a change of parameter. To find this law consider the 
function 


g(v) = e mr v 


o^y^w) 

©3(^Jt) 


Simple calculations show that g(v + 1) = g(v) and g(v + r) = g(v), i.e., g is 
a doubly periodic function. The zeros of the denominator are of the form v = 
(to + ^) t+ (n + |) and the zeros of the numerator are determined from the relation 
t'v — (to + ~) t' + (n + i), i.e., 



Therefore, the zeros of the numerator coincide with the zeros of the denomina¬ 
tor; hence, g is an entire doubly periodic function. Therefore, g = A, where A is a 
constant. 

By replacing v consecutively with v + |, v + | and v + , we derive from the 

equation 

(7.1) Qz(t'v\t') = Ae- niT ' v2 9 3 (v\T) 


the following equations: 

(7.2) e 2 (T'v\T') = Ae~ 7riT ' v2 ©o (u|r), 

(7.3) ©o^IrO = Ae~ wiT ' v2 ©2 (u|r), 

(7.4) ©i(r / v|r / ) = iAe~ niT ' v2 ©i(i;|r). 

Taking the product of equations (7.1)-(7.3) and setting ?t = 0we get 

(7.5) ©2(O|t / )0 3 (O|t / )0o(O|t / ) = A 3 0 2 (O|T)0 3 (O|T)©o(O|r). 

The derivative of (7.4) with respect to v yields at v = 0 

(7.6) T , e , 1 ( 0 \T , ) = iA@[. 
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Since©]. — 7 t 0 o 0 2 © 3 , equations (7.5) and (7.6) imply that A 2 = —ir, i.e., 
A = d:\Z~-ir. As a result we get 

©3 (0|t / ) = 7t© 3 (0|t). 

It is also clear that for a purely imaginary r both © 3 (0|r) and 0 3 (O|r / ) are positive. 
Therefore, 


(7.7) 0 3 ^Oj - = \/ 7T0 3 (0 |t). 

Transformations of Dedekind’s 77 -function. Transformations of the theta 
functions as the parameter r varies lead to transformations of Dedekind’s ? 7 -function 
since 

2nrj 3 (r) = ©^(Olr). 

From the relation ©i(0|r + 1 ) = eTT©i(0|T) we get 

(7-8) r}(r + 1 ) = e^rj(r). 

If we take into account that A = ^f^vr and r' = -A, then (7.6) takes the form 

e 'i (°l - 7) = (vQ7) 3 e;(o|r). 

Hence, 

(7.9) 77 = V-irr)(r). 

With the help of formulas (7.8) and (7.9) we can obtain the transformation 
laws for the functions /, fi and / 2 under the change of parameter r *—> r + 1 or 
t 1 —> — —. For example, 


(7.10) /(r + 1) 

Similarly, we get 


. _ 7X1 

e 24 


77 (^) 


ix % 
' 24 


t?(t + 1) 


^( 2 ) 


77(r) 


7X1 

24 


AW- 


(7.11) h(r + 1) = e 

(7-12) /2(t + 1) = e 12 / 2 (r). 

Observe that /(r + 2) = e~^f(r) and, therefore, /(r + 48) = /(r). 

It is easy to verify that 

(7.i3) = = 

V T J V(~r) V-7T77(t) 

Replacing r with - ~ we get 

(7-14) h (-A = Mr). 

Similar calculations do not yield an expression for /(—A). However, if we use 
the fact that 

f(r)fi(T)f 2 (T) = V2 
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we get 

(7.15) /(-;)=/(A 
Now we prove the relation 

(7.16) f(r)f (j^j) = V2. 

Equations (6.4) and (6.5) imply that /i(2t)/ 2 (t) = y/2. Since 

/i(2r) = e“S/(2r- 1) 

and 

/ 2 (r)=/i^-ij=eS/^--ij, 

then /(2r - l)/(-^) = \/2- Set x = 2r - 1. Then 1 - J and the obtained 

relation is equivalent to (7.16). 


7.8. The general scheme of solution of quintic equations 


The remaining part of this chapter is directly concerned with the solutions of 
quintic equations. The construction is rather complicated and is based on various 
facts whose proofs often require cumbersome calculations. To help the reader grasp 
this construction we first describe the general scheme: what is done and to what 
purpose. 

Let f(r), /i(r), / 2 (t) be the functions defined in §7.6. Set 


u - /( t ), 



and Voo — /(5r). 


The study of the behavior of u and v under the changes of parameter tht + 2, 
r i-» —t _1 and r t—> shows that these changes transform uv and ujv as follows: 



UV 

u 


V 

T 1 — > T + 2 

e ~m/2 uv 

piri/Z u 

V 

T 1 —> —- 

UV 

u 

T 

V 

T T+l 

— JL 

_ v_ 

UV 

u 


The transformation law of the index c of the function v c is the same as that of 
r. Now it is not difficult to demonstrate that 



or, equivalently, 

u 6 — u 5 v 6 + Auv + u 6 = 0. 

Let us consider u = /(r) as a parameter. For every value of this parameter we 
get a sixth degree equation whose roots are explicitly expressed in terms of r. We 
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would like to do the same for the equation y 5 + by = a. It turns out that this can 
be done as follows. Consider the fifth degree polynomial with the roots 


( 8 . 1 ) 


w 7 


_ (Voo ~ V Z )(V Z+1 - V z -!)(v z+ 2 ~ U z _i) 


where 2 = 0,1, 2,3,4. 


u 


Calculations will show that this polynomial is of the form 

w(w 2 + 5) 2 — u 12 + 64u -12 . 


After the substitution 

( 8 . 2 ) 

we get the equation 


V(r) 


fi( T ) — /I ( r ) 

/ 2 (t)(u> 2 (t) + 5) 


y +5y 


f! 


/! 


P 


Thus, to solve the equation y 5 + 5y — a, we have to proceed as follows. First, let 
us find a r for which /f (r) - /|(r) = a/ 2 (r). This problem reduces to solving a 
quadratic equation (and calculation of the inverse of /). Then we compute w z (r ) 
by formula (8.1), and compute y z (r) by formula (8.2). These are the roots of the 
equation considered. 

We begin with studying the behavior of u and v under the changes of param¬ 
eter r (§§7.9-7.12). This is precisely the part of our program connected with the 
most cumbersome calculations; the theoretical background for these calculations is, 
however, quite simple. 


To the matrix 
mation 


a b 
c d 


§7.9. Transformations of order 5 

we can assign a fractional linear or Mobius transfor- 
ar + b 

T I > --. 

CT + d 

Under this assignment the transformation r f—> r + 1 corresponds to the matrix 

/1 i ^ 

and the transformation r i corresponds to the matrix S = 


T = 

f 0 


0 1 
-V 


1 0 J - It is easy to verify that the transformation r h-> A(Bt) corresponds to 

f -l 0 ' 

, 0 -1 


the matrix AB and to the matrix E 


there corresponds the identity 


transformation. Formulas (7.8) and (7.9) can be rewritten as follows: 


(9.1) n(Tr) = e«T,(r). 

(9.2) v(S T ) = V~ir‘v{ T )- 

Let 57.72 (Z) be the group of integer 2x2 matrices with determinant 1. One 
can show that the group SL 2 (Z) is generated by 



and 
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This means that if A G SL 2 ( Z), then r/(Ar) can be expressed in terms of r/(r) 
with the help of formulas (9.1) and (9.2). This explicit expression in terms of 
the elements of the matrix A was first obtained by Dedekind and this is why the 
function 77 (r) is called Dedekind’s ^-function. 

One can also obtain a similar expression for /(At), but it involves not only 
/(r) but also /i(r) or f 2 (r). For example, 


f(Tr) — e 24/i( t ). 


We will only need these expressions for certain matrices A. We will prove these 
expressions separately and will not deduce a general formula for them. 

The solution of the fifth degree equations is based on the study of transforma¬ 
tions /(r) 1 —* f(Pr ), where P = ^ ^ is an integer matrix with determinant 

equal to 5. If A G SL 2 (Z), then f (APr) can be expressed in terms of f(Pr ) (or 
fi(Pr) for i = 1,2). Therefore, one has to find the simplest form to which P can 
be reduced by left multiplications by a matrix A G SL 2 (1j). 

Let c and d be relatively prime numbers such that cp + dr = 0. There exist 
integers a and b such that ad — be — 1. 

This means that 


(“ d) eSL ^ 


fa b\ fp q\ 
\c dj\r sJ 


where p's' = 5. Using the fact that 


fp' Q 1 
V 0 s' 




(1 n\ fp q\ _ fp q + ns\ 

VO 1 ) V° s ) v° s )' 


we can reduce the matrix P to the form ^ ^ ^, where — §<<?<§■ As a result 

we see that P can be reduced to one of the following forms: 


Pr, 



Po 



p± 1 





Let v c = f{P c r ), i.e., Uoo(r) = /(5r) and vjr) = /(r + c/5) for c # 00 . Let us 
try to find out how v c behaves under the change of parameter r t— Bt. This change 
of parameter corresponds to the right multiplication of P c by B. By multiplying 
P C B from the left by a matrix A G SL 2 {%) we can obtain the matrix P d , i.e., 
f(P c Br) can be expressed in terms of v d = fiPdT) ( or fi{P.dT))- 

The next three sections are devoted to the derivation of these expressions for 
the changes of parameter th->t + 2 , th and r h-> —j. 


§7.10. The change of parameter r h r + 2 

The change of parameter r t—> r + 1 sends f(r) into (t). Therefore, in 

order not to appeal to the function / 1 , let us confine ourselves to the change of 
parameter r 1 —> r + 2. This change of parameter sends, for example, /(5r) into 
/(5r + 10) = e~Tr/(5r) and /(f) into /(^). 

These transformations look different: in one case we have a factor of e~ ^ , 
whereas there is no factor in the other case. The point is that our notations are ill 
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adjusted, Selecting the functions v c more adequately we can make all transforma¬ 
tions look alike. 

As was noted in §7.7, the period of the function / is equal to 48. Therefore, 
varying the integer c we only have 240 distinct functions /(Residues modulo 
240 are divided into 48 five-tuples consisting of numbers with the same residue 
modulo 48. Select the numbers c from the same five-tuple by demanding that they 
be divisible by 48. This can be done by changing the notation: 



where c = 0 (mod 48). We directly see that v c — v d if c = d (mod 5). This means 
that for the index of v we may take the residue after the division of c by 5. Since 
48 = —2 (mod 5), it follows that 


^±2 f 


(T 5 ) 


) "^±1 ■ f 


r ± 96 \ 

5 )' 


For vo and v^ we preserve the previous notation, i.e., vq — f(^) and Vqq = /(5r). 
This is the notation that we will use in the sequel. The change of parameter 
tht + 2 replaces the function v c with 


j, /r + 2 + c\ 7r + 50 — 48 T- c \ _sjg 


f 


f i / ' 

T + C 


where d = c — 48 = 0 (mod 48) and d = c + 2 (mod 5). Thus, v c is replaced with 
e~Tlr v c+2 and this also holds for c = oo. Miraculously, the index c is transformed 
by the same rule as the parameter r. In the next two sections we will see that this 
is also true for the changes of parameter and r i—> . 


§7.11. The change of parameter r i—> — 1 

The change of parameter th- 1 replaces the function Voo = /(5r) with 


It is also easy to verify that v 0 is replaced with v Xl . Therefore, v c is replaced 
with u_i/ c for c = 0, oo. For the other values of c this is also true but the proof 
requires cumbersome calculations. 

To figure out the behavior of the functions v c for c / 0, oo, we have to represent 
the matrices from SL 2 (Z) in the form of the product of the matrices S and T. This 
is most easy to perform for matrices with small elements; therefore, we will use the 
fact that 


V±2 ~ f 
v±l = f 


' T qF 48' 
5 

V ± 96' 


= / 
= / 


V±2 

5 

V 4= 4 


57ri 


10 = 12 / 


t ± 2' 


±2oY=e=f s S‘/('lTi' 


The change of parameter r — ~ induces maps of functions 


v ±2 •—> e 


T 57 ri 
12 


/ 


/ ± 2 r — 1 


V 


5r 


and v±i t-^ e^ 5 ® 1 / 


( =p4r - 1' 


v 


5r 
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Let us start our calculations with the function ) = v . 2 - For this function, 

to one of the six main forms. 


-2r —1 


we have to reduce the fractionally linear function 5r 
Following the reduction algorithm described in §7.9 we get the relation 


2 1 
-5 -2 


Therefore, 


where 


Hence, 


/ 


— 2 r — 1 


f 


5 r 

A ' 1 = ( 

( — 2 r — 1 
\ 5 r 


-2 -1 
5 0 


/ U" 


(o ' 

V — 2' 


2 1 
-5 -2 


f ST ST~ S 


= -ST 2 ST~ 2 S. 
r-2V 


It is easy to verify that 

f (ST 2 ST~ 2 S (3) = f(T 2 ST~ 2 S(3) = e~&f(ST~ 2 S(3) = e~^e^f(J3) = f((3). 
Therefore, the function /( T + 48 ) i s replaced with 


__ P.TTL « 

e 12 / 


> - 2 ' 


57ri 


= e 12 / 


V + 48 


10 =/ 


V + 48' 


Similar calculations demonstrate that the function /( r = ^ --) goes into /( T ^ •)■ 
Hence, u c turns into n c for c — ±2. It is also clear that —1 = c for c = ±2. For the 
function calculations are somewhat lengthier. Since 


4 3 

5 4 


-4 -1 
5 0 


-1 -4 
0 -5 


and 


it follows that 


4 3 

5 4 


-l 


= stst~~ 4 st~ 


f f .— "j = / ^stst~ 4 st~ 1 ^ 


r + 4' 


It is easy to verify that 

f(STST~ 4 ST~ 1 P) - f (T ST~ 4 ST~ X (3) = h(ST~ 4 ST~ l $) 

= e~^f 2 (T- 4 ST- x i3) - e- 9 ^f 2 (ST- 1 p) 

= e-^f 1 (T- 1 0) = e-^m. 

Therefore, the function /(--F^) becomes /(+|p-). Clearly, 


/ 


V + 4' 


= / 


96 


5' 


+ 20 


_ Q7TZ 

e 6 


/ 


r — 96 


Since | + | + | = 2 and e 27 ™ = 1, we see that v\ — /(-^p) turns into /(—g^) = 

v-i. 

Similar calculations show that u_i becomes Vi. 
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Thus, the change of parameter r — ~ sends the function v c into v_i for all 


§7.12. The change of parameter r h* 

Let us prove that the change of parameter r (->■ sends the function v c into 
where d = In the proof of this statement the relation (7.16), i.e., 


f(r)f 


T — 1 

r + 1 


plays an essential role. For the calculations we need a relation of the form 


( 12 . 1 ) 


1 -1 

1 1 


x y 
0 2 


a j 3 \ ( a b\ f 1 — 1 
7 <5 J \ 0 dill 1 


where the matrix ^^ ^J is given and the matrix G SL 2 (Z) is selected 

so that the lower left element of the matrix is indeed 0. By multiplying 


( 12 . 1 ) from the left by 


1 1 

-1 1 


we reduce this condition to the following one: 


{—a + 7 ) (a + b) + (6 — (3)d — 0 . 


The relations for vq and are the easiest to obtain; they are of the form 


1 

1 1 ) 
(1 - 1 
V l 1 


1 -4 
0 5 

(1 4 
VO 5 


1 —4 
0 1 


5 0 1 -1 

0 1 j I 1 1 


1 0 ) f 1 01 -1 

-4 lj U 5 1 1 


Therefore, 


y/2 4-n-i f T 1 \ _ 4 Tri . 

7 ( 3 ) - “ / ( 7 —) ’ iwr 6 '" 1 

where in deducing the second formula we used the fact that 


4 P ( 1 T — 1 \ 

e “T57TI ■ 


-4r + 1 


4r — 1 


_ 47ri 

e 24 ' 


Taking into account that e 


T we get 


■ t- 1\ —y2 

, T + 1 ) f(rfey 


f —- 

J \5 r + 1 


/W 


The relations for v\ and 77 are of the form 


( 12 . 2 ) 


1 -l\ / 1 0 
1 1 J \0 5 

1 -l\ (5 0 

1 1 J lo 1 


3 -1 

—2 1 


1 l\ /I -1 
0 5 ) l 1 1 


3 1 Wl -1 

2 1 j l 0 5 


1 -1 

1 1 


(12.3) 
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Let us perform the calculations for v \. Under the change of parameter r i—» ~~ 
the function v\ = /( T + 96 ) becomes 

/a + 96\ f a + 1 \ 19 . /a + l\ _.ii>«■..* £ , 0 . 

f \ 6 ) = f V - i - + 19j = e J=e - AW), 

where a = and /5 = ^dd. Relation (12.2) means that 


V 2 f 3/3-1 \ 

/(f) U-2-0 + 1; 


It is easy to verify that 
3/5-1 


/ 


-2/5 + 1 


= ea</i 


0 


-2/5 + 1 


= e24 / 2 2 


0 , 


= e^h{(3). 


Hence, 


e 


197rj 

24 


SM = 


—e 


5?ri 

24 


/i(/J) = 


V2 

/(I)’ 


i.e., the function ui turns into 
that V 4 becomes — — — — 

* f(5r) Doo 

For V 2 we get the relation 


y/2 

/(f) 


y/2 
Vo ' 


Similar calculations demonstrate 



and then it is not difficult to verify that v? turns into — We similarly prove 
that V 3 turns into — —. 

° t>3 

The results of the calculations performed in §§7.10-7.12 can be arranged in the 
following table, where e — e~ 1 2 ?: 



U 

Vqo 

U) 


V2 

V3 

V 4 

tht + 2 

7T i 

e 12 n 

evoo 

£^2 

£+s 

£V4 

£v 0 

£V\ 

T 1—> T — — 

T 

u 

Vo 

Voo 

U 4 

V2 

V3 

Vl 

r 1 — > T 7^ 

+2 


__U2 

_ +2 


_^2 

_V± 

r+l 

u 


t>4 

v 0 

V2 

V3 



§7.13. Functions invariant with respect to the 
changes of parameter tht + 2, th — ~ and r 1 —> 

The function / 24 (r) is fixed under the changes of parameter r h r + 2 and 
TH-1, whereas under the change of parameter r 1 —> (fF-y it turns into ■ 

If we consider the function 
012 

nr) = f\r) + 1W) 

= ? _1 n (! + q 2 k ~ 1 f i + 2 12 9 JI (1 + ? 2t - 1 )" 24 = q - 1 + 24 + • • • , 
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then, as is easy to verify, it does not change under the transformation of parameter 
r i * as well. Indeed, this change of parameter turns jtkjy -y into / 24 (r). 

We can prove that if the function g(r) is meromorphic in the upper half plane 
Imr > 0 and does not change under the transformations tht + 2, th-I and 
r l— » yqq, then, under a certain condition, g(r) — R(F(r)), where R is a rational 
function. The condition in question is that the function g(q), where q — e KlT , 
should be meromorphic. For a meromorphic function g(r) the condition reduces to 
the fact that the point q = 0 is not essentially singular, i.e., there are only finitely 
many nonzero coefficients c n with negative n in the expansion g(q) — Y^L-oo c nQ n - 
(§§7.16-7.19 are devoted to the proof of this statement.) 

Moreover, the equation F(r) — c is solvable for all complex c =4 0 (we will 
prove this in §7.19). If R(F(r)) is finite for all r such that F(r) / oo, then R is a 
polynomial. Indeed, let R be a fraction with a nonconstant denominator. Then its 
denominator vanishes for some F(t) ^ oo. If R(F) is finite for q = 0 as well (i.e., 
F — oo), then R is a constant. 

We will use these facts first to deduce the modular equation and then to solve 
quintic equations. 


§7.14. Deduction of the modular equation 

With the help of the transformation table (see §7.12) we can easily verify that 
the transformation law of the functions uv and u/v, where u(r) = f(r), is as follows: 



UV 

U 

V 

tht+2 e 

■rci ni ?/ 

2 uv e 3 - 

V 

T 1—> T — — 

T 

uv 

u 

V 

T T—1 

— JL 

uv 

_ V 

u 

Under these transformations the index c of the function 

way as r. 

Consider the functions A c = (u/v c ) 3 
They transform as follows: 

+ (v c 

/u) 3 and B c 


A 

B 

tht + 2 

- A 

-B 

TH-1 

r 

A 

B 

T I—* T ~_\ 

T+ 1 

-A 

-B 


As a result we see that the function (A c — B c ) , where the product is taken 
over c — oo,0,l,2,3,4, does not vary under all the above transformations of r, 
since these transformations only permute factors. 

It is easy to see that 

^oo — q 2 (1 — 2 # + •••), Bqo — q 2 (1 — 2q + • • -). 
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Hence, Aqq — B !yc vanishes at <7 = 0, i.e., as Imr oo. Let us demonstrate that 
A c - B c vanishes at q — 0 for all c. Clearly, 

) = f ( T ) = U ( T )- 

(Recall that in these formulas c - 0 (mod 48).) Moreover, A and B do not vary 
under the interchange of u with v. Hence, 


u(5r — c) — f (5r ) = Voo (r ) , u c (5r - c) = / ^ 


5r — c + c 


(14.1) A c (5r - c) = ^oo(r), B c (5t - c) = ^(r). 

Since the conditions Im(5r — c) — > oo and Imr —> oo are equivalent, it follows that 
A c — B c — 0 vanishes at q = 0. 

As a result, we see that the function (A c — J5 C ) 2 is a constant that vanishes 
at q — 0. Therefore, A c — B c = 0 for some c. Formulas (14.1) show that then 
A c — B c ~ 0 for all c. Therefore, 


i.e., 

(14.2) 



(uv ) 2 — 


4 

7 A ’ 

(m>) 


u 6 — u 5 v 5 + 4 wr; + u 6 = 0 . 


Equation (14.2) relates w — /(r) with v = /(5t); it is called the modular 
equation. Fixing u — f(r) and considering (14.2) as an equation for v , we find that 
its roots are Voo — /(5r) and v c = /( 2 y £ ) for c = 0 (mod 48). The Fieta theorem 2 
implies, in particular, that 


(l 4 - 3 ) JJu c = w 6 . 

We will need this relation to solve the quintic equations. 


§7.15. Solving quintic equations 

In §7.14 we have shown that the coefficients of a sixth degree polynomial with 
roots v c can be expressed in terms of u. Let us now prove that the coefficients of a 
fifth degree polynomial with roots wq,Wi, W 2 ,W 3 and W 4 , where 


(15.1) 


w z 


(Vqo Vz){v z +1 Vz — l'){Vz+2 ^ 2 — 2 ) 

VEu 3 


can also be expressed in terms of u and find the explicit form of this polynomial. 

With the help of the transformation table for v c we can compose the following 
transformation table for w z : 



Wq 

W\ 

W 2 

w 3 

W4 

THT + 2 

—W2 

~W 3 

—W4 

-w 0 

—Wi 

T 1 —> — - 

T 

W 0 

w 2 

W\ 

W4 

Ws 

T — 1 

r t—► ~r 

T+l 

-Wq 

-w 3 

-W4 

-w 2 

—Wi 


2 In Soviet mathematical schools the Vieta theorem was the statement that if x\ and X 2 are 
the roots of the quadratic equation x 2 + bx + c = 0, then x\ +X 2 ~ —b and xi X 2 — c. Translator. 
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To calculate how w z transforms under the change of parameter r z - 1 - , we must 
use relation (14.3). 

Consider the polynomial 

(15-2) n<- ~ Wi ^ = S" A 2 w 3 + A 3 W 2 + A 4 W + A 5 . 

Its coefficients are finite at u ^ 0, 00 . Moreover, the functions A 2 , A 2 , A 2 , A 4 
and A 2 do not vary under the changes of parameter rt—> r + 2, r h -I and 

r t-» ~j. Therefore, they are polynomials of u 24 + 2 u u ~ 24 = q~ x + 244_.""such 

a polynomial is nonconstant only if its power series expansion in q begins with the 
term cq r , where r < — 1. 

Let us calculate the first term in the expansion of w z . Since 

OO 

/( r ) = Q~ n J7(l + <? 2/c_1 ), where q = e mr , 
k=l 


,57r ir 


for 


OO 


the first term in the expansion of v c is equal to (q') ^, where q' — e 

and q' — e 5 for c ^ 00 . It is easy to verify that e~"^t = a c , where a — e~^ L . 
(Recall that csO (mod 48).) Therefore, the first term in the expansion of w z is 
equal to 

JL-1— / 5-4-1 »_n_L_ / ~.a o 

a ) i —L 

-— = A q 10 


q 24 q 120 — a 


— rv - 2 !) q 120 


where A 


,2 * 


(a 3 —a—a J +a 3 ) 

7E 


V5q s 

- a 2z , because 


a 3 -a-a 1 + o+ 3 = 2 (cos36° + cos 72°) = V5. 


Thus, the expansion of A s begins with q~w ( or with the term cq r \ where 
r > “fo)- Therefore, the functions Af, A 2 , A\, and A 4 are constants whereas A\ 
linearly depends on u 24 +2 l 2 u ~ 24 = q~ J +24+- • • because its expansion begins with 
where /x - ( a 2 a 4 a 6 a 8 ) 2 = 1. Comparing the first terms of the expansions 
of the functions u 24 + 2 x 2 u ~ 24 and A 2 we get 

A l = u 24 + ~+C. 
rU 4 

To calculate the constants C, Ay, A 2 , A3 and A4, it suffices to calculate the value 
of v c for one r. Most convenient for calculations is r = i. Indeed, -1/i — i and by 
(7.14) we get 

(!5-3) /i(i) = /a(0- 


It is also clear that for a purely imaginary r the functions /, /1 and f 2 assume 
positive values. Therefore, (15.3), ( 6 . 1 ) and ( 6 . 2 ) imply that u = /(«)' = \/ 2 . Using 
the fact that (2 — x )(2 + i) — 5 we get 


+5 = / 


i + 48 


= f 


'i- 2 


+ 10 


1 0 7T t 

24 


/ 




Similarly, V 4 = i \/ 2 . 
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For t — i the modular equation takes the form 


(15.4) v 6 — a 5 v 5 + a 9 v + a 6 = 0, 

where a — \/2. We have already found two roots of this equation, namely, v 3 — —ia 
and v 2 = ia. Dividing the polynomial (15.4) by (v - v 2 )(v - v 3 ) = v 2 + a 2 we get 

v 4 - a 5 v 3 + a 2 v 2 + a 7 v + a 4 = (v - a) 2 (v - ft) 2 , 

where a+0 — a and a,6 — —a 2 . Assuming that a > 0 and (3 < 0 we get a — 
and /3 — ■- 1 . It is also clear that 

Uoo = /(5i) = /(—l/5i) - /(i/5) = u 0 

and we additionally have > 0 since f(r) > 0 for any purely imaginary r. 
Therefore, 


^0 ~~ ^oo — 


V2{l + Vs) 


v x =v 4 = 


</2 (1 - \/5) 


2 ” 2 
Substituting the value of v c at t — i into (15.1) we get 

wo = 0, Wi = w 2 = iVb, w 3 = w 4 = —iV 5 . 

Therefore, for r = i the polynomial (15.2) is of the form 

w(w - i\ZE) 2 (w + is/ 5) 2 == w(w 2 + 5) 2 . 

Thus, A|(i) = 0, i.e., 


<7 = - 


(" 24 ( i ) + ^)=-( 2 6 + 2 6 )=-27 


Hence, 


i.e., 


Ai = u 2i 


Ol2 / 06 

-2 7 =(u 12 - 


u 


24 


U 


2 

12 I ’ 


4 , = ± - A 


u 


12 


It is easy to verify that the expansions of functions A 5 


12 


-wqWiW 2 w 3 w 4 and 


— 2 6 u 12 begin with —<p and respectively. Hence, A$ = —it 12 + 2 6 u 12 


u 

and, therefore, equation (15.2) is of the form 

w (w 2 + 5) 2 = u 12 - 64it~ 12 . 
The relations f 8 = ff + /f and ff x f 2 - \/2 imply that 


u 


12 64 __ / 24 (t)-64 

12 


It 


/ 12 ( t ) 


' /l ( r ) — /l( r ) 

, / 2 W . 


Hence, 

Set 

(15.5) 


\/w(t) ~ ± 




y(r) 


/ 2 (r) (tu 2 (r) + 5) 

/f ( r ) ~ /I ( T ) 

/ 2 (r) (w 2 (r) + 5)' 
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Then 

y 5 + 5y = y ( W 2 + 5) = ll.yj 2 . 

If (/f — /|) /f 2 — a, then the roots of the equation 

(15.6) y 5 +5y = a 

can be calculated as follows. First, calculate v c (t) for c = oo, 0, 1, 2, 3, 4. Next with 
the help of formula (15.1) calculate w z (r) for z = 0, 1, 2, 3, 4. Finally, calculate 
y z (r) for z = 0,1, 2, 3, 4 by formula (15.5). These are the roots of equation (15.6). 
To learn how to solve equation (15.6) for any value of the parameter a, we only 
have to learn how to solve equation 

(15.7) /f ( r ) — /! (r) — o/ 2 (r). 

Taking into account that /f + /f = f 8 and //1/2 — \/2 and squaring (15.7) we 
reduce it to the form 

(15.8) / 24 — a 2 / 12 — 64 = 0. 

Equation (15.8) is a quadratic equation for f 12 . One of its roots gives a solution 
of equation (15.7), the other root gives a solution of the equation obtained from 
(15.7) under the change of parameter a —a. 

* * * 


In §7.13 we have formulated certain properties of functions invariant under the 
changes of parameter rnrf 2,TH-i and r (-»• We used these properties 
to solve quintic equations. Let us now prove these properties. 


7.16. The main modular function j(r) 


In this section we will construct a function j(r) invariant under the changes 
of parameter r h r + 1 and t h — k. The function j (r) is of interest thanks 
to the fact that any function invariant under these changes and satisfying certain 
meromorphicity conditions can be represented as a rational function of j. 

First, consider the functions 


Hr) 


9|(0|t) 

e§(0|r)’ 


k'(r) 


eg(Q|r) 

9|(0|t)' 


They are related by equation k 2 + k' 2 = 1 (cf. §7.3). The transformation 
formulas of the theta functions with respect to the parameter r obtained in §7.7 
enable us to verify that 


k{T+1)=z Wr k ' (T+l) = WY 

k(— 1/r) = k'{r), k'{— 1/r) — k{r). 

Now, consider the function A(r) = ( k'{r )) . The changes r t + 1 and 
t sen( ] ^ i n t,o A -1 and 1 — A, respectively. We want to obtain a function j (r) 

invariant under these changes of parameter. To this end it suffices to construct a 
rational function of A invariant under the interchanges of A with A -1 and 1 — A. 
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Such a function can be obtained as follows. Under the action of the above 
transformations, the number A can become .one of the following numbers: 


A, 


1 

A’ 


1 — A, 


1 A A-1 

1 - A’ 1 - A’ A 


Therefore, the function 

Ji(A) = A 2 + ~ + (1 - A) 2 + + (tTt) + (ht) 

is invariant under the transformations indicated. 

The two other functions obtained from J\ by linear transformations, namely, 


J\ + 3 (A 2 — A + l) 

2 A 2 (1 - A) 2 


and 

T t 27 ((A + 1) (A — 2) (A — T 

j 3 = j 2 - T = y --J , 

are often more convenient. The function J 2 is, actually, the most convenient one 
for our purposes. Set 


j(r) = 2 8 J 2 = 2 


, 8 (1 ~ A + A 2 ) 3 _ 8 (1 - k 2 k' 2 )' 


k A k' A 


A 2 (1 - A) 

8 ( 0 I-0^) 3 _^ 8 (0I~04©4) ; 


© 8 © 1©1 


(2tt) 


©i 8 




3 __ (/ 24 - 16) 11 


f 24 


24 

Using the fact that / 24 (r) = g _1 J] (l -f g 2fc_1 ) we can obtain the following 
expansion for j: 


3p) 


= (r - p ' 5 =?- 2 n ( 1 + 9 2 *- 1 ) 48 - 48 ,-' n ( 1 +,“-') 24 

o° 

+ 3 • 16 2 - 16 3 q]J (1 + q™- 1 )- 24 = q ~ 2 + 744 + ^c n tf n . 


The function j (r) is invariant under the changes of variable r h r + 1 and 
r h —1 Since the change of variable r h r + l sends q — e KlT into —q, the 
function j(q) is even. Hence, all the coefficients c n with odd indices vanish. 


§7.17. The fundamental domain of j(r) 

The function j(r) is defined on the upper half plane H = {r € C ] Imr > 0}. 
At the points obtained from each other under the transformations r —> r ± 1 and 
r H4 -1 the values of j coincide. 

T J 

Let us show that the set D — {r € H\ |r| > 1, |Rer| < 1/2} possesses the fol¬ 
lowing two properties: 

1) Any point r H can be sent into a point r' € D by a composition of 
transformations thtH and r ^ -1. 

T 

2) No two distinct inner points of D can be transformed into each other by a 
transformation indicated in property 1). 
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Let G = S.L ‘2 (Z) /{±I} be the group of fractional linear transformations of the 
form 

where (“ d) e “AZ), 

i.e., a,b,c,d€ Z and ad - bc = 1. 

The transformations S : r — and T:tht + 1 belong to G. Therefore 
the group G' generated by S and T is a subgroup of G. This remark enables us 
to prove that for a fixed r G H one can select among all points g'r, g' g G\ the 
one with the maximal imaginary part (and it is impossible to select one with the 

minimal imaginary part since lim„^ )CO Im =0). If g £ G, then 


Im(^r) = Im 


/ ar + b 
\ct + d 


— Im 


adr + bcr 
cr + d 


Imr 
cr + d\ 2 


Therefore, the inequality Im (gr) > Im(r) is equivalent to the inequality |cr + d| < 1 
which is only true for finitely many pairs of integers (c, d). Thus, Im (gr) takes 
finitely many values larger than or equal to Im(r) (although each such value is 
taken for an infinite set of elements g £ G). As a result we see that in the search 
of an element g £ G for which the imaginary part of gr is maximal we can confine 
ourselves to a finite set of elements. These arguments hold not only for the whole 
group G but for any subgroup of G as well. 

Let r' have the maximal imaginary part among the images of r 6 H under the 
G'-action. Since the transformations r'wr'ildo not change the imaginary part 
of r y , we may assume that |Rer'| < Let us show that in this case r' G D , i.e., 
|r / | < 1. Indeed, the condition Imr' > Im {g'r 1 ) implies, in particular, that 


Imr' > Im 




i.e., It'| > 1 . - 

Now we prove a refined version of property 2). Namely, we show that under the 
action of the elements of G' the boundary points of the domain D are identified as 
shown on Figure 42, i.e., the rays QP and SP are glued (under the transformations 



Figure 42 
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r r ± 1), the arcs ' QR and ^ SR are also glued (under the transformation 
r I—» — i). No other boundary points are glued. Observe that if the infinite point 
P belongs to D , then after the indicated gluing D becomes a sphere, i.e., j can be 
considered as a function on a sphere. 

In the following section we will show that j defines a one-to-one map of this 
sphere onto CUoo. 

Suppose that r' — g'r, where g' € G'. and let r and r' be two distinct points 
of D. We may assume that 


, ar + b 
9 r 


ct + d’ 



e SL 2 (Z). 


First, let us consider the case c = 0. In this case ad = 1, i.e., the transformation 
is of the form r i—> r ± b. For case b = 0 the points r and r' coincide. If b ^ 0, then 
the image of D intersects with D only for transformations r t—? r ± 1, and their 
intersection belongs to the set |Rer| = ~. 

Now suppose that c ^ 0. Then 

, ar + b a 1 

cr + d c c (ct + d) ’ 

i.e., 


(17.1) 



The numbers ~ and ~ are real, and, therefore, the imaginary parts of the numbers 
r' — ^ and r + ^ are equal to the imaginary parts of the numbers r' and r, and, 
therefore, the absolute values of r' and r are not less than . Hence, |c| < -%=. 
Since c is a nonzero integer, we have c = ±1. Therefore, (17.1) can be expressed 
in the form It' =j= a\ • |r ± d\ = 1. If r G D and m € Z, then \r + m\ > 1, where the 
equality takes place in the following cases only: 

1) |r| = 1, m = 0; 

2) r — exp(m/3 ), m — 0, —1; 

3) r = exp(27T7/3), m — 0,1; 

Therefore, if c / 0, then only points whose absolute value is equal to 1 can be 
glued; moreover, to glue to the points other than exp(A)7ri/3) is only possible via 
the map th- 1. It is also possible to glue to the points exp(/c7ri/3) via the maps 
TH-i±l. 

T 

Let us notice that we have proved one more refinement of property 2). Namely, 
if t is an inner point of D and gr G D, where g E G, then g is the identity trans¬ 
formation. With the help of this property we can prove the following statement. 


7.17.1. Theorem. The group G = SL 2 (Z) /{±I} is generated by elements S 
and T, i.e., G' — G. 


PROOF. Let g be an arbitrary element of G. Consider an inner point To of D. 
Since 

lm(gr) ImT 


|2 ’ 


|ct + d\‘ 

it follows that qtq G H. Therefore, there exists an element g' of G' such that 
9' (9 T o) G D. As a result we see that the inner point tq of D turns into a point 
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of the domain D under the transformation g'g (z G. Hence, g'g is the identity 
transformation, so that g = ( g / )” 1 € G'. □ 

If we wish the domain D to have no distinct points obtained from each other 
under transformations from G, we can achieve this, for instance, by excluding the 
ray PQ and the arc — QR from D (the points P,Q and R themselves are not 
excluded). The obtained set is called the fundamental domain of the group G. If 
we do not distinguish the points obtained from each other under the transformations 
S and T, then the domain D itself can be called the fundamental domain. The 
fundamental domain can be endowed with the structure of a topological space, since 
it is the quotient of H modulo the G-act ion. 

The value of j at any point r <E II coincides with the value of j at the corre¬ 
sponding point r' € D. Moreover, as we will show in the next section the values of 
j at distinct points of the fundamental domain of G are distinct. This is why D is 
also called the fundamental domain of the function j. 

§7.18. How to solve the equation j(r) — c 

The function j (r) = q~~ 2 + 744 H-has no singularities in the upper half plane 

H = {t | Imr >0}. Hence, the function j has no poles in the upper half plane H. 
For Imr = oo the function j (r) is infinite. 

We start solving the equation j(r) — c by computing the values of j at the 
“vertices” of the fundamental domain, i.e., at the points i,e = and e 2 . Recall 

that j{r) = (V 16 - if) and f{i) = f/2 (cf. §7.12). Hence, j(i) = 12 3 = 1728. To 

calculate j(s ) = j(e 2 ), we can make use of the fact that s ~ 1 — A. Indeed, this 
relation immediately implies: 

m = /(>-))= e-®/ 2 (£), /i(0 = /, fl - I) = e-»/(e). 

Hence, -Jl — f{s)fi(£)f 2 (e) = f 3 {s ) and, therefore, f 24 (e) = (v^) 8 = 16. Hence, 

j(e) = 0 . 

7.18.1. THEOREM. The function j(r) takes every value c in the fundamental 
domain exactly once. 

PROOF. First suppose that c is distinct from 0 and 1728. If the value c is 
taken on the boundary of the fundamental domain, we can modify the fundamental 
domain as indicated in Figure 43 (on the next page). More exactly, if the value c 
is attained at a boundary point, then we cut a neighborhood U of this point and 
attach to D the image of U under one of the maps S or T ±x . Such an operation 
is impossible to perform for the “vertices” but we cannot encounter them since 
c ^ 0,1728. 

If the meromorphic function g(z) has no poles in G and no zeros on the bound¬ 
ary dG, then the number of zeros of g inside G is equal to 

J_ [ iPl dz 

2« Jeo g(z) ' 

Indeed, let g(z) - c 0 (z - a) r + c^z - a) r+1 -j-. Then 

9 '{z)/g{z) - r(z - a)” 1 + • • ■ . 
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Figure 43 

Therefore, the sum of residues at the singular points of g' jg belonging to G is equal 
to the number of zeros of g (multiplicities counted). 

For G take the domain plotted in Figure 44, and for g take the function j(z) — c. 



Figure 45 
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It is easy to verify that 


f j'( z )dz 

f p 2 f(z)dz r ia ~ 

JdG j( z ) ~ c „ 

’Pi j( z )~ c Jia+ 


dln(j(z) - c). 


Indeed, j(z + 1) — j(z) and, therefore, the integrals along the line segments P\ Q 
and SP 2 cancel. Moreover, j(-l/z) = j(z ) and the integrals along the arcs —QR 
and RS also cancel. (This statement remains true even if we have to modify the 
boundary of the fundamental domain as well.) 

Since j ( r ) = q 2 + 744 + • • * = e~ 2mr + 744 + • • • for sufficiently large a and 
z £ \ia — |, ia + |], we have 


d In(j (z) -c) = d{~2mz + •••) = -2 mdz + •■•'. 

Therefore, 

1 f ia ~5 

lim / dln(j(z) — c) = 1. 

a ~*°° 2m J ia+ i ' 

If c = 0 or 1728, we can proceed as follows. Consider the domain G plotted in 
Figure 45. (If the value c is attained on the boundary, then we have to modify the 
domain as before,) 

The number of zeros of the function j(z) — c belonging to G is equal to 

1 f f( z )dz __ 1 r(i) r(e ) 

27r * JdG j( z ) - c 2 3 ’ 

where r(i ) and r{e) are the multiplicities of the zeros of the function j(z) — c at 
the points i and e, respectively. To prove this statement, it suffices to observe that 
for i we take the integral over the half circle, whereas for the points e and e 2 the 
integral is taken along a third of this circle and the circle is circumvented clockwise 
which leads to the minus sign. 

If c = 0 or 1728, then either r(e) ^ 0 or r(i) ^ 0. In this case the number of 
zeros of j(r) ~ c belonging to G is strictly less than 1, i.e., j(r) ^ c for r £ G. 
Observe also that if c = 0, then r(e) = 3, and if c = 1728, then r(i) = 2. This 
means that the function j has a zero of multiplicity 3 at e, whereas j - 1728 has a 
zero of multiplicity 2 at i. □ 


§7.19. The functions invariant under the 
changes of parameter r t-> r + 1 and r - ~ 

r 

Any function g(r) defined in the upper half plane H and invariant under the 
changes of variables ri->r + l and r t~> — ~ can be represented in the form G(j(r)) 
for some function G. Indeed, as was shown in the preceding section, j determines 
a one-to-one map of D U oo onto CUoo. Therefore, there exists an inverse map 
j 1 : CUoo i * DUoo. Set G(z) = gij'^z)). Then G(j(r)) = g{r l {j{r))) = g(r). 

Now suppose that the function g considered as a function of q — e 7rtT is mero- 
morphic in the disk |g| < 1. Then G is a rational function. Indeed, since g is 
meromorphic, the finite singular points of G can only be poles. The value j - oo 
corresponds to q = 0; hence, by the assumption on g the point oo cannot be an 
essentially singular point for G. Therefore, in C U oo the function G does not have 
singular points other than the poles. Let us subtract from G the principal parts 
of its power series expansions at the singular points. As a result we get a function 
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without singular points in C U oo, i.e., a constant. Since all the principal parts of 
the expansions of G at singular points consist of finitely many terms, we deduce 
that G is a rational function. 

§7.20. The functions invariant with respect to 
the changes of parameter r t + 2 and r \ 

To solve a quintic equation we needed a description of the functions invariant 
with respect to changes of parameter r i—> r + 2, t i—> — \ and r t—■> j. So 
far, we have only described the functions invariant with respect to the changes of 
parameter r t + 1 and r s—► —K Now it is easy to get a description of the 
functions we are interested in. 

Recall that 

(/ 24 (r) - 16) 3 / 7 m (t)-16 \ 3 / 16 __ 16 \ 3 

/ 24 M V / 8 W ) V f S P)J ' 

The function / 24 (r) is invariant with respect to the changes of parameter tht + 2 
and th- 1. In the class of functions stable under these changes it plays the same 
role as the function j(r) plays in the class of functions stable under the changes 
of parameter tht + 1 and r t—► — -. Namely, any function g invariant under 
the changes of parameter r h r + 2 and r (-> -f is a rational function of / 24 
(provided the function g considered as a function of q — e mr is meromorphic in the 
disk |<jr] < 1). This can be proved by the same method that we used in §7.19. We 
have only to prove the following statement. 

7.20.1. Theorem, a) The fundamental domain of the group G\ generated by 
the transformations r i—> r + 2 and r t —* — ~ is 

Di = {r € H j |r| > 1, | Rer| < 1}. 

b) In the domain D\ the function f 24 attains each nonzero value c exactly once. 

Proof. Let r' be the image of r € H under the action of G\ with the maximal 
value of Imp (the existence of such a point r' is proved in §7.17). Since the 
transformation t' t' ± 2 does not change the imaginary part of P, we can 
assume that |ReP| < 1. In this case P e L>i, i.e., |P| > 1. Indeed, since Imp is 
maximal, it follows that 

Imp > Im ^ — ^ = Imr'/lPj 2 ; 

hence, \r'\ > 1. 

We proved that any point r € H can be mapped to a point P G D\ under 
the action of some g € G\. To prove that distinct inner points of D\ cannot be 
transformed into each other under the action of elements from G\ it suffices to 
verify b). Let us do this. 

It is convenient to complete the domain D\ (Figure 46) with the points ±1 
(corresponding to the value q = —1). For these points q we get 

OO 

r = g~ l n (l + g 2 *- 1 ) 24 = 0 

k~l 
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Figure 46 

and this zero is of infinite multiplicity, i.e., for the function j = (/ 16 — 16// 8 ) the 
points r = ±1 are essentially singularities. In the domain D\ the function j has 
no poles, therefore, the function / 24 does not vanish anywhere in D\. Moreover, 
for Im r = oo the values of the functions j and f 24 are equal to oo. It remains to 
consider the nonzero values of / 24 . 



Figure 47 

Let G be the domain plotted in Figure 47. Taking into account the gluings 
shown in Figure 46 we get for c / 0 


dln(f 24 (z) — c) 


'dG 


dln(/ 24 M-c). 




Since / 24 (r) — q~ x + • • • = e“ 7rtT + • • •, it follows that 

dln(/ 24 ( 2 ;) — c) — d(—7riz + •••) = ~7r idz + 

Hence, 


lim 


dln(f 24 (z) - c) = 1. 


a ~*°° 2m J ia+ 1 
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This means that in D\ the function / 24 assumes each nonzero value exactly 
once. □ 


Now, suppose that the function g satisfying the meromorphicity condition is 
invariant not only with respect to the changes of parameter tht + 2 and t — ~ 
but also with respect to the change r i—> Then g(r) = i?(/ 24 (r)), where R is 

a rational function such that -R(/ 24 (yipj)) = i?(/ 24 (r)), i.e., R(j = R(f 24 ). 

Set / 24 (r) = x and 2 12 = a, and let R(x) — J2kL-oo c k% k ■ Then 


R 


OO 

= ^2 C ka k X~ k . 

fc—_oo 


Hence, Cka k = c_fc, so that 


i?(x) — c 0 + 



Considering the differences (t + ~) k — (x k + it is easy to prove by induction 

on k that x k + [f.) k can be polynomially expressed in terms of x + Therefore, 
R(x) — G(x + for some function G. Since R is rational, G has no singular points 
other than poles, i.e., G is a rational function. As a result, we see that the function 
g(r) can be rationally expressed in terms of the function / 24 (r) + p*( r ) • 
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